Tools

Tools: Building an Async Rust Runtime on io_uring: 7.5ms vs Tokio's 14.9ms

2026-05-05 0 views admin

The Question That Started Everything

Why Does Async Exist at All?

Enter io_uring: The Kernel's Secret Weapon

How RingCore Works: A Tour of the Four Layers

Layer 1: Talking to the Kernel (src/sys.rs, src/ring.rs)

Layer 2: Wrapping Operations in Futures (src/op.rs)

Layer 3: The Executor (src/executor.rs)

Layer 4: Friendly Wrappers (src/net.rs)

The Benchmarks

File I/O : reading a 100MB file

Networking : sequential and concurrent requests

Advanced: kernel-level task chaining

The Mental Model That Changes Everything

What's in the Repo

Requirements

Why Build This Instead of Just Using Tokio?

This is Part of a Series You use async/await every day. But do you know what actually happens when your code "pauses"? I didn't, so I built something to find out. The result is RingCore, a minimal async runtime in Rust, built directly on Linux's io_uring, with zero abstraction layers in the way. No Tokio. No hidden thread pools. Just Rust, a kernel interface, and a lot of curiosity. If you've written async Rust, you've probably typed this: And it just works. The program doesn't freeze. Other tasks keep running. But I kept asking: what is actually happening when .await suspends a task? Where does execution go? Who wakes it back up? How does the OS fit into any of this? Most tutorials stop at "the runtime handles it." That answer never satisfied me. Imagine you're a chef in a kitchen. You put a steak on the grill and just stand there watching it cook. You don't prep the salad. You don't plate the dessert. You just wait. That's synchronous I/O. Your program calls read(), the OS fetches data from disk or the network, and your thread sits idle until it comes back. Wasteful. Async I/O lets you be a smarter chef. You start the steak, set a timer, and go do other things. When the timer fires, you come back and finish. In Rust, async/await is the language-level mechanism for writing this kind of code. But Rust itself doesn't define how the waiting works, that's the runtime's job. Most people reach for Tokio, which is fantastic and production-ready. But it's also a black box. I wanted the white box. Traditional async I/O on Linux is expensive. Every interaction with the kernel requires a context switch, which is a CPU jump from user mode (your program) into kernel mode (the OS) and back. Under heavy I/O load, these add up fast. io_uring, introduced in Linux 5.1 by kernel developer Jens Axboe, takes a radically different approach. Instead of making individual system calls, your program and the kernel share two ring buffers in memory: Think of it like a diner counter with a ticket window. Instead of running to the kitchen for every order, you slide all your tickets through the window at once and the kitchen slides the finished plates back. One trip. Maximum efficiency. Multiple I/O operations can be batched into a single io_uring_enter system call. Context switches plummet. Performance soars. The lowest layer handles raw kernel communication. No OS library wrappers. No abstraction. RingCore manually invokes SYS_IO_URING_SETUP and SYS_IO_URING_ENTER via libc, and uses mmap to map the kernel's SQ and CQ ring buffers directly into the process's address space. This is the part most async tutorials skip entirely. In RingCore, it's front and center. This is where things get interesting. Rust's Future trait is simple: poll it, get Poll::Ready(value) if the result is done, or Poll::Pending if not along with a Waker so someone can nudge it later. In RingCore, every io_uring operation becomes a Future. Here's the key poll implementation: The elegant part: when the kernel finishes and writes a CQE with a matching ID, the executor retrieves the stored Waker and calls it. No magic, it's just a map, an ID, and a callback. The executor is the brain that orchestrates everything. Its main loop is beautifully simple: This is a classic event loop similar in spirit to Node.js, but with direct kernel access instead of libuv underneath. The top layer gives you TcpListener and TcpStream with clean async fn methods. They feel like normal Rust networking but under the hood, they're submitting SQEs to the ring. The whole stack, four files, clean separation, nothing hidden. Tested on Debian 13, Kernel 6.12. Comparing RingCore against std and Tokio. Tokio is 5× slower here. Why? Tokio doesn't use io_uring for file I/O by default, it offloads blocking file reads to a thread pool, which adds significant overhead. RingCore uses true async kernel operations. The 1,000-request stress test is the eye-opener. Tokio takes over a second because its thread-per-task model drowns in scheduling overhead at scale. RingCore handles all of it on a single thread, with the kernel doing the heavy lifting. Using IOSQE_IO_LINK, RingCore chains dependent operations (like Read → Write) so the kernel executes them back-to-back without ever returning to userspace. One io_uring_enter call. Zero ping-pong. Here's what building RingCore made concrete for me, the thing no tutorial made clear before: When you .await something in Rust, you're saying: "I'm not ready yet. Here's my callback (the Waker). Come get me when something changes." The executor moves on to other tasks. The kernel works in the background. When the kernel is done, it writes a CQE. The executor reads it, finds the matching Waker in the map, and calls it. Your task wakes up and continues from where it left off. That's the entire model. RingCore makes every step of it visible and there's no layer you can't read. Examples are organized into four tiers so you can explore progressively: Tier 1 : Proving the runtime Tier 2 : The async model Tier 3 : Real workloads Tier 4 : Advanced features Start with echo, trace through the source, and you'll have a complete mental model of async I/O in about an afternoon. Tokio is the right choice for production. I'm not suggesting you replace it. But if you've ever stared at a select! macro, a JoinHandle, or a .await and wondered what is actually happening in the kernel right now, building something like RingCore is the answer. I'm not intimidated by async Rust anymore. Not because it got simpler, but because I can now see every moving part. The abstraction didn't disappear, I just understand what it's abstracting. RingCore isn't the first time I've gone down this rabbit hole. A few weeks ago I also built a container engine in Rust that starts in 10ms, cracking open Linux namespaces, cgroups, and clone() syscalls along the way. The two projects rhyme. With the container engine I asked: what actually happens when you run a container? With RingCore I asked: what actually happens when you .await? Both answers live in the kernel. Both are learnable. The best way to demystify them is to build a tiny, intentionally incomplete version yourself. If this sparked any curiosity about systems programming, async I/O, or Rust internals, that was the whole point. Issues and PRs are very welcome. Templates let you quickly answer FAQs or store snippets for re-use. Hide child comments as well For further actions, you may consider blocking this person and/or reporting abuse

Code Block

Copy

Share this article

Twitter Facebook LinkedIn Reddit

🏷️ Tags

toolsutilitiessecurity toolsbuildingasyncruntimeio_uringtokio

More from Tools

Tools: Securing a Linux server with HTTPS does not have to be complicated. This guide walks through how to (2026)

2026-05-05 0

Tools: How to Fix Multi-Material: Troubleshooting Tips - Complete Guide

2026-05-05 0

Tools: Report: How to Deploy Mixtral 8x7B with vLLM on a $28/Month DigitalOcean GPU Droplet: Mixture-of-Experts Inference at 1/75th API Cost

2026-05-05 0

Tools: Fortifying the Frontier: Essential Frontend Security Best Practices (2026)

2026-05-05 0

Trending

1

CVE-2025-61481: Critical Remote Code Execution Vulnerability in MikroTik RouterOS & SwitchOS

2025-10-27 • 189 views

2

CVE-2025-43939: Dell Unity OS Command Injection (High)

2025-10-30 • 148 views

3

Google disputes false claims of massive Gmail data breach

2025-10-30 • 130 views

4

Microsoft: DNS outage impacts Azure and Microsoft 365 services

2025-10-30 • 88 views

5

3.5B Accounts, 1 Critical Flaw: Meta Closes WhatsApp Data-Harvesting

2025-11-25 • 81 views

InfinitSec - Latest Cybersecurity, Technology & Gaming News

Tools: Building an Async Rust Runtime on io_uring: 7.5ms vs Tokio's 14.9ms

The Question That Started Everything

Why Does Async Exist at All?

Enter io_uring: The Kernel's Secret Weapon

How RingCore Works: A Tour of the Four Layers

Layer 1: Talking to the Kernel (src/sys.rs, src/ring.rs)

Layer 2: Wrapping Operations in Futures (src/op.rs)

Layer 3: The Executor (src/executor.rs)

Layer 4: Friendly Wrappers (src/net.rs)

The Benchmarks

File I/O : reading a 100MB file

Networking : sequential and concurrent requests

Advanced: kernel-level task chaining

The Mental Model That Changes Everything

What's in the Repo

Requirements

Why Build This Instead of Just Using Tokio?

🏷️ Tags

More from Tools

Tools: Securing a Linux server with HTTPS does not have to be complicated. This guide walks through how to (2026)

Tools: How to Fix Multi-Material: Troubleshooting Tips - Complete Guide

Tools: Report: How to Deploy Mixtral 8x7B with vLLM on a $28/Month DigitalOcean GPU Droplet: Mixture-of-Experts Inference at 1/75th API Cost

Tools: Fortifying the Frontier: Essential Frontend Security Best Practices (2026)

Trending

CVE-2025-61481: Critical Remote Code Execution Vulnerability in MikroTik RouterOS & SwitchOS

CVE-2025-43939: Dell Unity OS Command Injection (High)

Google disputes false claims of massive Gmail data breach

Microsoft: DNS outage impacts Azure and Microsoft 365 services

3.5B Accounts, 1 Critical Flaw: Meta Closes WhatsApp Data-Harvesting