RecAccel™ N3000
The Neuchips RecAccel™ N3000 revolutionizes deep learning recommendation systems by combining patented FFP8 technology with advanced hardware architecture. This PCIe Gen5 solution delivers over 20M inferences per second at 20 Watts, achieving >99.95% accuracy while offering 1.7x better performance-per-watt. Built on TSMC 7nm process and supported by a comprehensive software ecosystem, the N3000 sets new standards for AI acceleration in modern data centers.
for DLRM
RecAccel N3000 Specification | |
---|---|
Embedded ARC HS48 Processors |
|
Embedded ARC EV72 Processors |
|
Communication Interface | PCle Gen 5x8 |
Memory |
|
Al Accelerators |
|
State-of-the-art Hardware Solution for AI Recommendation System Acceleration
Boost the power of your AI recommendation systems with the RecAccel™ N3000 PCIe card. This dual-slot PCI Express Gen5 card delivers exceptional performance and reliability, making it perfect for the most demanding elastic data centers. Powered by the NEUCHIPS RecAccel™ N3000 series AI chip, this card offers powerful acceleration capabilities that take your AI recommendation systems to the next level. Experience the ultimate in speed, performance, and reliability- and unlock the full potential of accelerated AI for your business.
Driven by Deep Understanding of Cloud Recommendation
The revolutionized domain-specific architecture design for cloud recommendation is a result of deep insights into the intricate interplay between compute-bound, latency-bound, memory-bound, energy-bound, and accuracy-bound requirements. With supreme algorithmic optimizations, hardware acceleration, data caching, and power management, this design promises to deliver unparalleled performance and efficiency, setting a new standard for cloud recommendation systems.
Industry Leading Results for MLPerf™ DLRM Inference Benchmarking
The RecAccel™ N3000 has proven itself to be a leader in both performance and power efficiency in the industry. During MLPerf™ v3.0, it has shown that the RecAccel™ N3000 system delivers 1.7 times better performance per watt for inference DLRM while maintaining 99.9% accuracy with the help of its proprietary INT8 calibrator.
SW-HW Co-design Achieves Linear Multicard Scalability
During system testing in MLPerf™ v3.0, the RecAccel™ N3000 delivered exceptional performance, with nearly 100% scaling observed across each card. This means that the card's performance increased linearly with the addition of more cards, allowing for seamless scalability and enhanced performance.
- Achieves 20 million inferences per second at just 20 watts
- 50% reduction in off-chip memory access
- 30% improvement in bandwidth utilization
- Industry-leading power efficiency for DLRM workloads
- Achieves 99.97% of FP32 accuracy with INT8
- Improves to 99.996% accuracy with FFP8
- Maintains high accuracy while reducing power consumption
- Optimized for production-grade recommendation models
- Available in dual M.2 modules
- Supports PCIe Gen 5 cards
- Scalable for various data center configurations
- Automated optimization path
- Easy integration with existing systems
- Comprehensive development tools and support