Grosan Flaviu Gheorghe

How to Limit GPU Power and Clock Speeds for AI Inference with a Dockerised Controller

Running multiple GPUs for local AI inference at stock power settings is wasteful. Consumer cards like the RTX 3090 draw 350W by default, but AI inference workloads are typically memory-bandwidth bound, not compute bound. The GPU cores spend much of their time waiting for data, burning power for no performance

How to cool passive NVIDIA GPUs (Tesla V100, P40) with a Dockerised Fan Controller

The NVIDIA Tesla V100 and Tesla P40 are passively cooled cards, designed for data centre chassis with high-volume front-to-back airflow. Used in a desktop workstation or a home server they will thermal throttle within minutes, because there are no onboard fans. This article describes a Dockerised solution that reads GPU

How to install drivers for NVIDIA Tesla V100 on Fedora 44 Server Edition for AI Inference

The NVIDIA Tesla V100 has become a surprisingly attractive GPU for local LLM inference, thanks to its end-of-life status causing a flood of cheap used cards on the market. This article describes how to install it on Fedora 44, and the exact steps to get it working with Docker and

Recompile Chromium with JavaScript, Cookies, Notifications and Profiles disabled, with Claude's help

I recompiled Chromium with JavaScript, cookies, notifications, and profiles disabled - no tracking, no banners, no nonsense. Here's how.

How to run SketchUp 2025 on Linux using Steam and GE Proton

How to run SketchUp 2025 on Linux by copying a Windows installation into Steam with GE-Proton. Early draft with working steps.

Grosan Flaviu Gheorghe © 2026