Skip to main content

Overview

From model development to deployment—build and optimize AI models on Eagle-N with a unified, production-ready software stack.

The AI Model SDK provides a comprehensive toolkit for developing and optimizing AI models on Eagle-N. It enables seamless support for conversational AI, perception models, and autonomous systems by bridging model development, compilation, and deployment across both host systems and the Eagle-N neural network accelerator.

Designed for efficiency and flexibility, the SDK streamlines the entire pipeline—from model preparation to high-performance on-device execution.

What sets our model SDK apart

  • Fully open-source foundation
    Unlike conventional NPU SDKs, Eagle-N is built on an open-source stack—giving developers full visibility, control, and freedom from vendor lock-in.

  • Custom operator & kernel development
    Easily develop your own operators and kernels to support new models or specialized workloads.

  • Performance-driven customization
    Modify and optimize existing operators and kernels to maximize performance for your specific use cases.

  • Deep hardware-level control
    Go beyond high-level abstractions—directly tune hardware-specific components for full-stack optimization. While this requires architectural expertise, our team provides dedicated support to help you fully leverage Eagle-N.

  • Rich model zoo & rapid model enablement
    Access a wide range of reference models and quickly adopt newly emerging AI models from industry and academia, supported by a global network of model development experts.

BOS NN Accelerator Apart


Designed for Developers

The SDK is built with developer productivity and system-level integration in mind:

  • Clean and modular software architecture spanning host and accelerator
  • Support for standard environments (Linux-based BSP, common programming models)
  • Runtime, driver, and toolchain integration for end-to-end workflows
  • Built-in profiling, logging, and observability for performance tuning
  • Support for asynchronous execution and efficient data movement

This design allows developers to move from experimentation to production while maintaining visibility into system behavior and performance.


Core Features

Unlock maximum performance on Eagle-N with hardware-aware optimizations and full-stack control. The SDK enables developers to go beyond standard frameworks and directly tune execution down to the NPU level, without black-box limitations.


Advanced Model Performance Optimization

  • Parallel execution across multi-cluster NPU architecture
  • High-bandwidth LPDDR5/5X memory utilization for data-intensive workloads
  • Support for multiple data formats (INT8, FP16, FP8) with on-the-fly conversion
  • Efficient data movement through NoC and DMA engines
  • Fine-grained control of low-level hardware operations
  • Optimized memory flow across Tensix cores
  • Custom compute kernel optimization for specific workloads

These capabilities enable low-latency, high-throughput inference while giving developers full control to extract the maximum performance from Eagle-N using transparent and open tooling.


Custom Ops Development Support

Extend the SDK with custom operations and pipelines tailored to your application:

  • Build reusable kernels and model components aligned with NPU execution model
  • Define specialized pipelines for perception, language, or multimodal workloads
  • Integrate proprietary logic while maintaining compatibility with the SDK runtime

This flexibility enables fine-grained control over performance, scalability, and deployment behavior across different system configurations.

AI Models Workflow

AI Workload Processing Flow

Eagle-N is optimized for processing complex AI workloads such as video and text analysis:

  • The host AP sends input data (camera, sensor, or text streams) to Eagle-N via PCIe
  • Eagle-N executes neural network inference using its multi-cluster NPU
  • Results are returned to the host AP for further decision-making or system integration

This division of responsibilities ensures efficient workload distribution and maximizes overall system performance.


Funtional Safety Integration

For automotive applications, Eagle-N integrates with a dedicated Safety MCU to meet functional safety requirements:

  • Monitors system health and detects operational faults
  • Sends alerts to the Safety MCU in case of errors or abnormal behavior
  • Enables rapid system-level response to maintain safe operation

This safety mechanism is critical for autonomous driving systems, ensuring reliability under all operating conditions.


For different usage models or advanced requirements please contact BOS through the support