AI Tutorials
TitanCore Core-1 LLM Training Infrastructure with C++ CUDA and ZeRO-3
Explore TitanCore Core-1, a high-performance C++/CUDA infrastructure designed for trillion-parameter LLM training using ZeRO-3 and custom fused kernels for 2.6x speedup.
Read more →