VXI

VELOX INTELLIGENCE

22nm SRAM-CIM AI ACCELERATOR

Velox Intelligence · Hangzhou VXI Technology

Purpose-Built Brain
for Edge AI Deployment

VXI 22nm SRAM-CIM × 1.5B LLM @ 5W → 35-45 tok/s

Mesh NoC + 64 MB CIM + 32 MB on-chip KV = The physics solution to memory-bound crisis

Market Pain Points

Three Bottlenecks of On-Device LLM

In 2024-2026, the on-device wave of DeepSeek, Qwen, Llama rewrites the rules: the bottleneck shifts from 'insufficient compute' to 'insufficient memory bandwidth'.

Bandwidth Saturation

Under LPDDR5's 25.6 GB/s physical ceiling, 1.5B model decode theoretically maxes at 33 tok/s, real-world often drops to 5-8 tok/s.

30x Efficiency Collapse

Traditional NPUs claim 30 TOPS/W, but in LLM scenarios, after deducting LPDDR data movement power, it collapses to < 0.5 TOPS/W.

5W Thermal Barrier

System power surges to 10-15W, making fanless enclosures, robot dogs, and palm-sized private cloud devices impossible to deploy.

Solution

The Physics Solution to Memory-Bound Crisis

Mesh NoC + 64 MB CIM + 32 MB on-chip KV

VXI 22nm SRAM-CIM SoC

System-Level Efficiency

27.8 TOPS/W

128 CIM Tiles × 512 KB

64 MB CIM array, weight-stationary + in-situ MAC, on-chip interconnect TB/s bandwidth

Mesh NoC 2 GHz, 256-bit

Unidirectional 64 GB/s cross-tile bandwidth, far exceeding LPDDR5's 25.6 GB/s ceiling

32 MB High-Density SRAM

16 MB KV Cache + 8 MB Activation Buffer + 8 MB Scratchpad, 8K context on-chip

Explore Architecture

0

CIM Tiles

0 MB

CIM Capacity

0 W

True Embedded TDP

0 tok/s

1.5B Decode Rate

Applications

From Personal Cloud to Edge Server

Personal Private Cloud / Me-Box

Personal Private Cloud / Me-Box

5W passive cooling + 1.5B fluent conversation + Web3 identity signing, data never leaves the shell.

Explore Scenarios
Xinchuang Desktop / Workstation

Xinchuang Desktop / Workstation

22nm domestic production, autonomous & controllable, SM4 hardware encryption, DeepSeek 7B becomes desktop standard.

Explore Scenarios
Industrial / Robot Dog Brain

Industrial / Robot Dog Brain

Wide-temp fanless, real-time inference + VLA vision-language-action model, the core brain of embodied intelligence.

Explore Scenarios

Product Roadmap

Tape-out Status & Milestones

2026 Q4

Master SoC Tape-out

2027 Q2

ASIC Bring-up

2027 Q4

VXI-EM5/XC8 Mass Production

2028 Q1

VXI-IR5 Mass Production

Specifications on this page are pre-silicon design targets. VXI 22nm SRAM-CIM ASIC is planned for Q4 2026 tape-out / Q2 2027 bring-up.

Ready to Start Evaluation?

VXI8805 FPGA evaluation board is now available, bundled with SDK + pre-quantized model packages for immediate software stack validation and model PoC.