On-Device AI

Explore our entire collection of insights, tutorials, and industry news.

  • AI Tutorials

    How to Run a 400B Parameter LLM on a Phone

    Discover the engineering breakthroughs behind running massive 400B models like DeepSeek-V3 or Llama 3 on mobile hardware using flash offloading and quantization.
    Read more