zabirauf || Zohaib

Link · May 27, 2026 · permalink

Crumb - a small self-improving agent harness in under 200 lines of hand-written TypeScript. Bun runtime, no external libraries.

The goal wasn't to build something useful. It was to take the magic out of "self-improving agent harness" by writing the smallest readable version of one.

When it decides to improve itself it edits its own code. The runtime reloads the new code into a fresh process and hands the conversation state back - so the next turn is the upgraded agent still talking to you with full memory.

It only has 5 tools which are get_user_input, call_shell, clear_conversation, restart and exit. call_shell does most of the heavy lifting and makes it into a capable agent.

Note · May 4, 2026 · permalink

I feel there is spectrum between vibe coding and agentic engineering, which I like to refer to as Outcome Driven Development (ODD). Agentic engineering is using agents but evaluating its outputs with the same thoroughness as if you wrote code by hand. ODD is where you care less about the generated code but more about the outcomes it produces and do a thorough testing on those outcomes.

Note · Apr 27, 2026 · permalink

I’ve been primarily using Claude Code. The recent launch of DeepSeek V4 and GPT 5.5, has made me realize the importance of having an open coding harness that is model agnostic. That will help me explore the capabilities of these new models. I’ve just reinstalled OpenCode and gonna give that a try again.

Link · Apr 21, 2026 · permalink

Lightening of a room has a lot of effect on how you feel about it

Good article describing CRI (Color Rendering Index). Living in the Pacific Northwest, past winters I've been thinking of how to improve SAD by using better lighting. This article provided me more knowledge on what should I be looking for.

Link · Apr 18, 2026 · permalink

Apps and programming: two accidental tyrannies

Coding agents have a tendency to make all experiences be alike. Using them to push UX is more of something we should do as experimentation is now a lot cheaper. I love the idea of apps being much more extensible and customizable by the users.

Note · Apr 18, 2026 · permalink

I've been thinking about AI-generated tests and the friction around them. AI can generate a lot of tests, but figuring out which ones are "good" requires human review that doesn't scale. The reframe I keep coming back to is to stop judging them on quality and start thinking of them as tripwires. Their job isn't to validate correctness, it's to alert reviewers when something significant changed. If a PR trips them, look closely. If nothing trips, the change likely didn't touch core logic.

If that framing holds, you could use techniques like mutation testing to find gaps where code changes would go unnoticed, then generate tests targeting those gaps at scale. The bar shifts from "is this a good test" to "would this catch a change," which is much easier to clear with less review overhead and directly accelerates code reviews by telling reviewers where to focus. Still thinking through this but it feels like an interesting direction to explore.

Feb 13, 2025

How to safely connect cloud server to home GPU server

Unlock home GPU power for cloud workloads with Tailscale and Docker. I demo a simple architecture that tunnels requests to your local rig, skipping pricey cloud GPUs. All ports stay hidden, ensuring a tight, cost-effective environment.

digital ocean docker how-to homelab · 5 min read min

Feb 8, 2025

DeepSeek-R1: A Peek Under the Hood

DeepSeek-R1 uses cost-effective Reinforcement Learning to unlock emergent reasoning. By rewarding correct, verifiable steps, it refines logic and answers—showcasing how systematic feedback can reduce data needs and boost performance. Here I discuss my understanding from research paper.

LLM-Training LLM · 6 min read min

Jan 1, 2025

Using LLMs and Cursor to become a finisher

Struggling to finish side projects due to limited time? In my latest blog post, I share how I improved my productivity using AI tools like LLMs and Cursor IDE. Learn to refine specs, bootstrap code, and iterate effectively to rapidly build and deploy your projects—even with a busy schedule.

LLM development · 8 min read min

Oct 27, 2024

Managing Secrets in NixOS Home Manager with SOPS

Discover how to securely manage secrets in NixOS Home Manager using SOPS and sops-nix. Learn to set up Age encryption, create encrypted secret files, integrate SOPS with NixOS, and access secrets as environment variables. Perfect for maintaining secure, declarative NixOS configurations.

nixos how-to · 4 min read min

Sep 6, 2024

Run custom GGUF model on Ollama

Learn how to install a custom Hugging Face GGUF model using Ollama, enabling you to try out the latest LLM models locally. This guide covers downloading the model, creating a Modelfile, and setting up the model in Ollama and Open-WebUI.

ollama how-to LLM open-webui homelab · 2 min read min

Jul 31, 2024

Guide to extending OpenWebUI using Pipelines

Explore OpenWebUI's Pipelines: extend your self-hosted LLM interface. Learn to create custom pipelines, from filters to tools. Intercept LLM interactions, implement function-calling, and integrate new providers. Enhance AI workflows or build RAG systems with this guide to OpenWebUI's extensibility.

LLM development open-webui · 7 min read min

Feb 15, 2024

A beginners guide to fine tuning LLM using LoRA

Discover how to create a synthetic dataset, select the right metrics for evaluation, and fine-tune your model using LoRA for a narrow scenario. Plus, learn how to serve your model efficiently using LLaMa.cpp on Mac/Linux.

how-to LLM LLM-Training · 8 min read min

Sep 30, 2023

A guide to using Python with Supabase securely

Learn how to use Python with Supabase for web development for all your AI/LLM backend needs. Create a secure Python service that validates JWT tokens from Supabase and integrates with Next.js.

how-to service react reactjs python · 6 min read min

Jan 28, 2022

Suggestions for onboarding remote Software Engineers

Onboarding remote Software Engineers can be challenging. Here are some suggestions from my experience of onboarding multiple engineers remotely to make them become part of team.

management team · 5 min read min

Apr 21, 2021

Why I created a vaccine finder

I recently built findvaccinefor.me [https://findvaccinefor.me] which gets data from WA state sources such as vaccinelocator.doh.wa.gov [http://vaccinelocator.doh.wa.gov] and prepmod.doh.wa.

· 4 min read min

Jan 18, 2021

Debugging in Google Colab notebook

Introduction on debugging in Python Jupyter notebooks with examples in Google Colab using ipdb.

python jupyter development · 4 min read min

Jun 9, 2020

Fence your TypeScript, for saner project boundaries

Solve the problem of dependency creep and add boundaries to your TypeScript project to limit what is exported and imported from a package.

typescript gulp · 4 min read min

Aug 7, 2019

Using path aliases in React Native created with Expo and Typescript

Managing big apps have its own problems and one of them is managing imports. Typescript allows you to have path aliases which makes it easier to refer to modules. In this tutorial we will see how we can use path aliases in React Native app created using Expo and Typescript.

typescript react-native babel development video-mini-tutorials · 2 min read min

Sep 11, 2018

Securing MongoDB using Let's Encrypt certificate

I had to setup few instances of MongoDB in DigitalOcean [https://m.do.co/c/1fe978d3b5b4] but I also wanted to make them secure using TLS/SSL. One way would

nginx mongodb digital ocean ubuntu · 5 min read min

Apr 18, 2018

Upgradeable smart contracts in Ethereum

Imagine a world where a software is maintaining millions of dollars worth of money, but there is an exploit which allows the hacker to take all that money away. Now

ethereum solidity blockchain · 5 min read min

Jan 11, 2018

Libraries in Solidity for code reusability and testing it

What are Libraries in Solidity You might have heard of the DRY principle (don't repeat yourself). It is very essential in large programs to have the ability to

ethereum solidity blockchain · 6 min read min

Jul 3, 2017

Leverage union types in Typescript to avoid invalid states

Typescript has a type system which provides more capabilities than the type system in C# and Java. The one that I found to be very useful is the union type

typescript · 2 min read min

May 12, 2017

Creating object based on interface type in Typescript

Interface in Typescript is used to tell the compiler what the shape of the JS object should look like. It is a compile time construct hence it will not have

typescript javascript · 1 min read min

Feb 25, 2017

Setting up continuous integration for F# Suave app to Azure

Whenever I start a project I tend to have a continuous build and deployment pipeline setup as it makes the development and testing a lot more easier. In this tutorial

service development fsharp deployment ci · 4 min read min