Contact_

Request a
Business Customized Quotation

[email protected]

Phone

+886 2 8797 8337

12F-2, No.408, Ruiguang Rd., Neihu Dist., Taipei City 11492, Taiwan

Center of Innovative Incubator R819, No. 101, Section 2, Kuang-Fu Road, Hsinchu, Taiwan

First Name

Last Name

Company

Company Email

Submit

EdgeThought:
The Ultimate Game-Changer
in the on-device LLM Inferencing landscape

ONNC Optimization
in Performance and Energy

Explore More

Abbr.	Use Case	Model	Quality Target
AD	Anomaly Detection	Deep AutoEncoder	90% (Top 1)
IC	Image Classification	ResNet	80% (Top 1)
KWS	Keyword Spotting	DS-CNN	85% (Top 1)
WWW	Visual Wake Words	MobileNet	0.85 (AUC)

Higher Performance

Our perfomance is 25.50% faster
at the best than TVM does.

ONNC-NUCLEO_L4R5ZI-zephyr

TVM-NUCLEO_L4R5ZI-zephyr

Less Energy

ONNC can only use 50% energy
at best than TVM does.

ONNC-NUCLEO_L4R5ZI-mbed_os

TVM-NUCLEO_L4R5ZI-mbed_os

25 %
Faster

50 %
Less

MLPerf™ Tiny v1.1 Results : closed

ID	Submitter	System Desc	Board Name	Software
available
1.1-0003	Krai	NRF5340-DK-MICROTVM-CMSIS-NN	nRF5340 DK	microTVM
1.1-0004	Nuvoton	NUMAKER-M467HJ-zephyr	NUMAKER-M467HJ	ONNC
1.1-0005	STMicroelectronics	NUCLEO-H7A321-Q	NUCLEO-H7A321-Q	X-CUBE-AI v8.1.0
1.1-0006	STMicroelectronics	NUCLEO-L4R5ZI	NUCLEO-L4R5ZI	X-CUBE-AI v8.1.0
1.1-0007	STMicroelectronics	NUCLEO-U575ZI-Q	NUCLEO-U575ZI-Q	X-CUBE-AI v8.1.0
1.1-0008	Skymizer	NUCLEO-L4R5ZI-mbed-os	NUCLEO-L4R5ZI	ONNC
		NUCLEO-L4R5ZI-zephyr	NUCLEO-L4R5ZI	ONNC

ID	Submitter	System	nodes	Processor	#	accelerator
Preview
3.0-0130	Nettrix	X640 G50 (8x L4, TensorRT)	1	Intel(R) Xeon(R) Platinum 8480+	2	NVIDIA L4
3.0-0131	Neuchips	Gigabyte G482-254 (1x RecAccel-N3000-32GB-PCIE)	1	AMD EPYC 7452 32-Core Processor	2	RecAccel N3000
3.0-0132	Neuchips	Gigabyte G482-254 (1x RecAccel-N3000-32GB-PCIE)	1	AMD EPYC 7452 32-Core Processor	2	RecAccel N3000
3.0-0132	xFusion	xFusion FusionServer 2288H V7(6x NVIDIA L4, TensorRT)	1	Intel(R) Xeon(R) Platinum 8458P CPU @ 2.7 GHz	2	NVIDIA L4

The Industrialization of AI

Skymizer’s system software solutions enable AI-on-Chip design houses to automate AI application development,
improve system performance, and optimize inference accuracy.

01_

Our Key Values

Automate Application Development

Enable AI SoC to be easily adopted in frequently evolved applications by automatically compiling AI algorithms into chip’s machine code.

Analyze System Bottlenecks

Leverage virtual platforms to conduct Performance-Guided Optimizations (PGO) to improve speed and reduce memory requirements by utilizing all available computing/memory resources in the complex heterogeneous multicore systems.

Enhance Accuracy and Performance through Hardware/Software Co-optimization

Align software development at a much earlier stage during SoC architecture exploration stage. Provide architecture-aware calibrations to maintain precision even in Int8 mode.

02_

Our Belief

LLM as Essential Interface

LLM will become a crucial interface for human-to-machine and machine-to-machine communication.

Edge Inference and Fine-Tuning Growth

Inference and fine-tuning are transitioning to the edge and are poised for much greater growth than training.

Key benefits include security, privacy, personalization, response time, and cost-effectiveness.

Dedicated Language Processor Unit (LPU)

LPUs offer superior performance, power efficiency, and cost-effectiveness compared to CPU/GPU/NPU solutions.

03_

EdgeThought

High Performance and Low Cost

High memory bandwidth utilization.

Shortest response time with minimal MAC requirement.

Programmable and Model-flexible

Minimal yet efficient LLM-specific instruction set supports diverse decoder-only transformers, including LLaMA2, LLaMA3, Mistral, Phi-2, Phi-3, Gemma, etc.

Currently focusing on 7-13B models. Larger models require more DRAM capacities.

Ecosystem Ready

LLM Frameworks: HuggingFace Transformers, Nvidia Triton Inference Server, OpenAI API, and LangChain API.

Fine-Tuning and RAG Toolkits: HuggingFace PEFT, QLoRA, LlamaIndex, and LangChain.

04_

Case Studies

Data Center

Colossal models like GPT3 become mainstream in the industry. See how ONNC deals with such challenge by collaborating compiler with runtime and makes heterogeneous multi-cards/multi-server possible.

Recommendation System

Recommendation Systems are extremely accuracy sensitive since they directly relate to enterprises' revenue stream. See how ONNC Calibrator minimizes precision error and helps you and your customers success.

Smartphones

It takes at least 18 months to develop an AI chip, while AI models and applications used in smartphones evolutes at all times. See how ONNC compiler prevents your AI chip phasing out before taping out.

SmartTV

Data movement optimization is a key breakthrough for minimize inference latency. See how ONNC runtime leverages this technology to provide best user experience for SmartTV users.

Surveillance

Designing heterogeneous multicore in SOCs tends to suffer from software fragmentation, see how ONNC's total solution reduces man-hours and R&D risk.

AIOT

Equipping MCUs with AI super power makes smart home and smart factory possible. See how ONNC's compiler fits a deep learning model into extremely resource-constrained MCUs.

05_

Products

ONNC

Commercial

Learn More

ONNC

Commercial

Learn More

ONNC compiler is a bundle of C++ libraries and tools to boost your development of compiler for deep learning accelerators (DLAs). ONNC compiler targets on diverse SoC architectures from a simply single core system to a heterogeneous system with multi-level memory hierarchy and bus connection.

Forest
Runtime

Commerical

Learn More

Forest
Runtime

Commerical

Learn More

Forest Runtime executes compiled neural network models on the hardware platform of your choice. It provides common C++ APIs with C and Python bindings for various AI application doing inference. Forest Runtime is '''retargetable'''. It has modular architecture and we've ported it on diverse hardware platforms, including ''datacenter'', ''mobile'' and ''TinyML''.

Calibrator

Commerical

Learn More

Calibrator

Commerical

Learn More

ONNC Calibrator leverages hardware architecture information to keep AI System-on-Chips in high precision through the post-training quantization (PTQ) technique. The key indicator to validate a quantization technique is its precision drop.

06_

Agencies

CIRCLE offers a complete suite of services for ASIC turnkey projects, including design, manufacturing, packaging, and testing.

Region: South Korea

Contact: 감태오(Taeo)

Email: [email protected]

Mobile: +82-10-5412-9275

Striving toward designing the world’s leading chip technologies and working with end-users to embrace an era of autonomous driving, cloud storage, and the Metaverse.

The Industrialization of AI

Skymizer’s system software solutions enable AI-on-Chip design houses to automate AI application development,improve system performance, and optimize inference accuracy.

Our Key Values

Automate Application Development

Analyze System Bottlenecks

Enhance Accuracy and Performance through Hardware/Software Co-optimization

Our Belief

LLM as Essential Interface

Edge Inference and Fine-Tuning Growth

Dedicated Language Processor Unit (LPU)

EdgeThought

High Performance and Low Cost

Programmable and Model-flexible

Ecosystem Ready

Case Studies

Data Center

Recommendation System

Smartphones

SmartTV

Surveillance

AIOT

Products

ONNC

ONNC

ForestRuntime

ForestRuntime

Calibrator

Calibrator

Agencies

Skymizer’s system software solutions enable AI-on-Chip design houses to automate AI application development,
improve system performance, and optimize inference accuracy.

Forest
Runtime

Forest
Runtime