ONNX Runtime: Unleash On-Device AI
Tl;dr ONNX Runtime is your cross-platform, hardware-accelerated inference engine to deploy fast and private AI anywhere. What even is ONNX? ONNX Runtime (pronounced like “Onix”, the rock-snake Pokémon) is a performance-focused scoring and inference engine for models in the Open Neural Network Exchange (ONNX) format, designed to support heavy workloads and deliver fast, reliable predictions in production scenarios. Developed by Microsoft, it provides APIs in Python, C++, C#, Java, and JavaScript, and runs on Windows, Linux, and macOS. Its core is implemented in C++ for speed, and it seamlessly interoperates with ONNX-exported models from PyTorch, TensorFlow, Keras, scikit-learn, LightGBM, XGBoost, and more. ...
Infinite Iterations
I tend to keep making new blog websites, updating the tech stack ‘cause I find something better, and just keep updating the repo. Yet, in all iterations, I always have a stock homepage which says something on the lines of ‘Welcome to my site, I’m still busy building this blog. Thank you for your patience’. The idea of starting a blog gives you the feeling of being productive and gets you going, yet I never happen to start ’the writing’. Funnily, this is the first post I’m writing here, brain dumping my problem of actually writing posts. ...