Scaling Model Training with Multi-Node GPU Clusters | 16 × AI