Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

High-efficiency C++/CUDA LLM inference engine. Runs Llama 70B on a single RTX 3090 (24GB VRAM) by streaming model layers through GPU memory via PCIe, with optional NVMe direct I/O that bypasses the CPU entirely. Key Results Model Mode Decode VRAM Notes Llama 3.1 8B Q8_0 Resident 48.9 tok/s 10.0 GB All layers in VRAM Llama […]
EDuke32 – Duke Nukem 3D (Open-Source)

Per-pixel dynamic lighting and realtime shadows… groovy! Polymer renderer requires a bad-ass video card. More Polymer greatness. Hollywood Holocaust with classic textures Come get some! EDuke32 is an awesome, free homebrew game engine and source port of the classic PC first person shooter Duke Nukem 3D— Duke3D for short—to Windows, Linux, macOS, FreeBSD, several handhelds, […]
Inputlag.science – Repository of knowledge about input lag in gaming

Home |inputlag.science inputlag.science Welcome Hello traveler, welcome to the repository of knowledge about input lag in gaming. The input lag in a gaming system, or any interactive system, is the latency between the user input and a reaction on the screen. Input lag is an issue that has crept in the industry, little by little, […]
Parse, Don’t Validate and Type-Driven Design in Rust

Reading time: 17 min read Table of Contents 1.1 Dividing by zero 1.2 Examples in the wild 1.3 Maxims of Type Driven Design 1.4 What can we do? 1.5 Conclusion Photo by the Tingley Injury Law Firm. In the Rust Programming Language Community Server, there’s tag named -parse-dont-validate which links to an article about the […]
How Taalas ”prints” LLM onto a chip?

or how to generate 17000 tokens per second? February 22, 2026 · 4 min read A startup called Taalas, recently released an ASIC chip running Llama 3.1 8B (3/6 bit quant) at an inference rate of 17,000 tokens per seconds. That’s like writing around 30 A4 sized pages in one second. They claim it’s 10x […]
Cloudflare outage on February 20, 2026

On February 20, 2026, at 17:48 UTC, Cloudflare experienced a service outage when a subset of customers who use Cloudflare’s Bring Your Own IP (BYOIP) service saw their routes to the Internet withdrawn via Border Gateway Protocol (BGP). The issue was not caused, directly or indirectly, by a cyberattack or malicious activity of any kind. […]
Canvas_ity: A tiny, single-header -like 2D rasterizer for C++

Example const context = document.getElementById( ”example” ).getContext( ”2d” ); // Build a star path. context.moveTo( 128.0, 28.0 ); context.lineTo( 157.0, 87.0 ); context.lineTo( 223.0, 97.0 ); context.lineTo( 175.0, 143.0 ); context.lineTo( 186.0, 208.0 ); context.lineTo( 128.0, 178.0 ); context.lineTo( 69.0, 208.0 ); context.lineTo( 80.0, 143.0 ); context.lineTo( 32.0, 97.0 ); context.lineTo( 98.0, 87.0 ); context.closePath(); […]
Personal Statement of a CIA Analyst

4 October 2018 CIA Applicant Screening I first took a polygraph when I applied to the CIA and went through the applicant screening process. To prepare for the test, I read A Tremor in the Blood by David T. Lykken. The book described the use of control versus relevant questions as well as countermeasures such […]
What Not to Write on Your Security Clearance Form
What Not To Write On Your Security Clearance Form as reported in the [REDACTED] list and RISKS Date: 01 Apr 88 1620 PSTFrom: Les Earnest LES…@S…Subject: The ”previous account” referred to in RISKS-6.51 e-t-a-o-n-r-i Spy and the FBI Reading a book got me into early trouble–I had an FBI record by age twelve. This bizarre […]
The Nekonomicon – Nekochan.net Archive, Updated
The Nekonomicon Volume 1 – Book of Endings What is the Nekonomicon? Volume 2 – Book of Notes The collected works of the Nekochan forums. Volume 3 – Book of Illustrations Nekochan photo gallery.