← All contributors
TI
contributor
Toma Iliescu
@toma
Training infrastructure for research-scale workloads. Spends most of his time figuring out why a run that worked on 8 GPUs falls over on 64. Currently working on FSDP recipes and large-scale debugging tooling.
1 article
Training focus
1 article