#artificial-intelligence #machine-learning #sys

ort-tract

ort + tract = 🦀 - An alternative backend for ort, powered by tract

3 releases (breaking)

Uses new Rust 2024

0.3.0+0.22 Mar 5, 2026
0.2.0+0.22 Jan 7, 2026
0.1.0+0.21 Jun 4, 2025

#15 in #sys

Download history 20/week @ 2026-03-03 702/week @ 2026-03-10 1195/week @ 2026-03-17 833/week @ 2026-03-24 960/week @ 2026-03-31 842/week @ 2026-04-07 623/week @ 2026-04-14 1169/week @ 2026-04-21 958/week @ 2026-04-28 1769/week @ 2026-05-05 3109/week @ 2026-05-12 1819/week @ 2026-05-19 1353/week @ 2026-05-26 1982/week @ 2026-06-02

8,844 downloads per month

MIT/Apache

32KB
727 lines

Coverage Results MSRV ONNX Runtime


ort is a Rust interface for performing hardware-accelerated inference & training on machine learning models in the Open Neural Network Exchange (ONNX) format.

Based on the now-inactive onnxruntime-rs crate, ort is primarily a wrapper for Microsoft's ONNX Runtime library, but offers support for other pure-Rust runtimes.

ort with ONNX Runtime is super quick - and it supports almost any hardware accelerator you can think of. Even still, it's light enough to run on your users' devices.

When you need to deploy a PyTorch/TensorFlow/Keras/scikit-learn/PaddlePaddle model either on-device or in the datacenter, ort has you covered.

📖 Documentation

🤔 Support

🌠 Backers

💖 FOSS projects using ort

Open a PR to add your project here 🌟

  • Text Embeddings Inference (TEI) uses ort to deliver high-performance ONNX Runtime inference for text embedding models.
  • Magika uses ort for neural network-based file type detection.
  • retto uses ort for reliable, fast ONNX inference of PaddleOCR models on Desktop and WASM platforms.
  • edge-transformers uses ort for accelerated transformer model inference at the edge.
  • sbv2-api is a fast implementation of Style-BERT-VITS2 text-to-speech using ort.
  • BoquilaHUB uses ort for local AI deployment in biodiversity conservation efforts.
  • CamTrap Detector uses ort to detect animals, humans and vehicles in trail camera imagery.
  • Ortex uses ort for safe ONNX Runtime bindings in Elixir.
  • oar-ocr A comprehensive OCR library, built in Rust with ort for efficient inference.
  • FastEmbed-rs uses ort for generating vector embeddings, reranking locally.
  • Ahnlich uses ort to power their AI proxy for semantic search applications.
  • Murmure uses ort as its core engine, leveraging NVIDIA Parakeet to deliver fully local, free, private and cross‑platform Speech‑to‑Text enhanced with LLM post‑processing.
  • Valentinus uses ort to provide embedding model inference inside LMDB.
  • SilentKeys uses ort for fast, on-device real-time dictation with NVIDIA Parakeet and Silero VAD.
  • Xybrid uses ort to run LLMs, ASR, and TTS natively on-device across iOS, Android, Flutter, and Unity apps and games.
  • Ultralytics YOLO Rust Inference is a high-performance, pure Rust library and CLI providing fast and efficient interface for running YOLO models using ort.

Dependencies

~13–20MB
~401K SLoC