Viper Series LLM Inference Card

The Neuchips Viper series is an enterprise-focused offline AI solution featuring the Raptor N3000 Gen AI processor, advanced data compression, and 64GB onboard memory. It specializes in secure, efficient RAG applications while maintaining complete data privacy through local processing.

Overview

Neuchips Products	Raptor Series Gen Al Processor Viper Series Gen Al PCIe Card
Offline Al Solution	Marketing &Sales	Operations	IT&Engineering	IT&Risk & Legal	HR	Utility & Manufacturing
Domain Focus Al Application	Create ProductLiteratures Analyze Customer Feedback Customer Support Service...	Identify Production Yield Rate Automatically Process & Agent Document Analysis	Create Technical Documentation Automatically Generate Data	Draft & Summarize Legal Documents Summarize and Highlight...	Assist in Interview HR Training System Candidate Assessment ...	Search & Question Answering Optimize Employee Communication Presentation Foil Creation Document Extraction & Data Analysis
System Integrator	HW & SW System Integration Service \| Domain Specific API Management
Software Service Provider / Application Interface	Device Management Platform \| Fine Tuning \| Document Partner \| RAG... etc.
Open Soruce Al Models	Gen Al/ LLM Models (Llama3.2 \| Mistral NeMo \| Phi 3.5 \| TAIDE \| Breeze)
System Hardware Manufacturer	PC \| IPC \| Workstation \| Server

Easy-to-Use SDK

Neuchips Raptor Series
User Application
Software Development Kit (SDK)
	Compiler	Neuchips PyTorch Extension
		FFP8 quantization
		Operator optimization
		Graph partition
		Graph optimization
		Memory planning
	Shared Kernel	Custom kernels
	Runtime

Transform Your Enterprise AI
Securely and Seamlessly

The Viper GenAI PCIe card represents a breakthrough in enterprise AI solutions, combining powerful hardware acceleration with practical business functionality. At its core, it features the Raptor Gen AI processor with specialized embedding engines that deliver 10x efficiency improvements. The system's 32 GB onboard memory serves as a dedicated vector database, enabling secure local data processing while significantly reducing CPU overhead. By supporting multiple open-source LLM models and offering seamless integration with existing infrastructure, Viper enables enterprises to implement AI capabilities without compromising data security or requiring extensive system overhauls. This makes it particularly valuable for organizations seeking to leverage AI technology while maintaining complete control over their sensitive data and operational processes.

Specification

Viper Series Specification
Specification	Description
Support LLM Model	Llama3.2, Mistral NeMo, Phi3.5, Breeze, TAIDE
Total Board Power	Min. = 25W Default = 45W Max. = 75W
Thermal Solution	Active and Passive Cooling Available
Mechanical Form Factor	HHHL-SS (half-height, half-length, single-slot)
Memory Type	LPDDR5
Memory Size	Up to 64GB
Memory Clock	6400 Mbps
Ambient Operating Temperature	0°C to 50°C
Storage Temperature	-40 °C to 75°C
Operating Humidity	5% to 85% Relative Humidity
Storage Humidity	5% to 95% Relative Humidity

Key Features

Embedded Vector Processing Engine

Built-in with efficient embedding engine
Reduces communication overhead between the card and host CPU
Provides 10x improvement in efficiency for vector similarity searches
Features 64GB LPDDR5 memory capacity, supporting up to 12B LLM models in a single chip and card configuration

32GB On-Device Vector Storage