[Ofmfwg] Sunfish meeting notes for September 5, 2025
Lee, Peter
peter.lee at necam.com
Fri Sep 5 08:50:56 PDT 2025
* Flux NVMe-oF Use Case
* Mike continues to work on the code but ran into two issues.
* First issue is he can't seem to mount the NVMe devices from a remote NVMe server. Mike probably did not use the appropriate chain of commands. Phil shared a link with the steps to mount remote NVMe devices and Mike will try again.
* https://enterprise-support.nvidia.com/s/article/howto-configure-nvme-over-fabrics#jive_content_id_NVME_Target_Configuration
* Second issue is Flux related. Mike has multiple Flux instances and they are all showing up as broker 0, when only one of them should be broker 0. Mike will ask for help.
* Flux/Sunfish for Datacenter
* Mike shares how Flux and Sunfish can manage the whole datacenter. Flux is hierarchical. Flux and Sunfish can allocate and manage resource pools for other clusters running Flux, Slurm, Kubernetes, etc. CI/CD pipelines can talk to central point (broker 0), which creates the resource pools and route them to the appropriate cluster(s).
* [cid:image001.png at 01DC1E3F.7CCAF140]
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ofmfwg/attachments/20250905/154bc05e/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 947483 bytes
Desc: image001.png
URL: <http://lists.openfabrics.org/pipermail/ofmfwg/attachments/20250905/154bc05e/attachment-0001.png>
More information about the Ofmfwg
mailing list