Flash-MoE

Explore our entire collection of insights, tutorials, and industry news.

  • AI Tutorials

    Running 400B Parameter AI Models on a Smartphone

    A technical breakdown of how Flash-MoE and Apple's 'LLM in a Flash' research enabled a 400-billion parameter model to run on an iPhone, and what it means for the future of hybrid AI applications.
    Read more