Zachary Mueller’s Post

View profile for Zachary Mueller, graphic

Technical Lead for Accelerate at HuggingFace

We're working on getting Hugging Face Accelerate 1.0.0 up and going, and decided to publish our roadmap publicly to get your thoughts, opinions, and just keep you in the loop! Check out more of what we're thinking: https://lnkd.in/er7_ZeRw Still learning the best ways to do things, for now the project has links to the relevant Accelerate issues once we've reached a point we can start discussing them. Please follow those to voice your thoughts! 🤗

Parth Saini

Senior Software Engineer | Gen AI | Cloud | IIT

5mo

Does accelerate multinode training work from inside containers hosted on different nodes?

Like
Reply
Parth Saini

Senior Software Engineer | Gen AI | Cloud | IIT

5mo

Containers are given host network I guess then. Pytorch runs on nccl backend. Nccl isn't able to find out eth0 networks with overlay network.

Like
Reply
See more comments

To view or add a comment, sign in

Explore topics