📄️ Qwen2.5-VL
Complete guide for deploying the Qwen2.5-VL multimodal model (3B and 7B variants) on BOS Eagle-N hardware. This guide covers model setup, Hugging Face authentication, inference execution, batch processing, and performance profiling with Tracy.
📄️ Llama-3.2
Comprehensive guide for running Meta's LLaMA 3.2 language models (1B, 3B, and 8B variants) on BOS Eagle-N hardware. Covers Hugging Face setup, authentication, model execution, batch processing, and performance analysis using profiling tools.