introduction

Henrik Albihn — Thu, 01 Jan 2026 12:00:00 GMT

I write here about the strange loop we're in: humans building minds; minds changing humans. Less futurism, more practice—what holds up when you ship.

The Strange Loop

Something has shifted. We are building systems that can surprise their creators—not only by failing in new ways, but by producing useful lines of thought we didn't see coming. The tools are starting to feel like collaborators.

It shows up in mundane places: refactors, incident reviews, product decisions, code written faster than you can fully justify. The future arrived quietly.

What I keep coming back to isn't the technology in isolation, but the feedback. We build AI, AI changes how we work, work changes what we build, and the cycle tightens. I want to understand the shape of that loop—and the cost of getting it wrong.

The Winding Path

My path looks discontinuous on paper: economics, bartending through grad school, then building AI systems.

Economics taught me to look for incentives and for the gap between a model and the world it claims to describe. Bartending taught me that people rarely say what they mean, but they always communicate what they need. Graduate school taught me that expertise is knowing which corners you're cutting, and why.

Reading people and reading data are more similar than I expected. Both punish overconfidence. Both reward attention to what's happening instead of what should be happening.

The strangest career advice I ever received: "Your job is to be confused in the right direction." I think about that constantly.

Philosophy of Work

I believe in doing less, better. Shipping early. Building systems that fail predictably and recover gracefully. Staying close to real problems instead of their filtered versions.

There's a particular kind of productivity that's actually procrastination dressed up in busy clothes. Meetings about meetings. Documentation no one reads. Processes designed to distribute blame rather than create value. I've learned to recognize it by the way it makes me feel productive without producing anything.

The opposite is also true: the most impactful work often looks like nothing. A small change that prevents a future incident. A conversation that removes a misunderstanding. A constraint that makes the wrong thing hard to do. It compounds quietly.

The Quiet Revolution

The best AI isn't a demo. It's quiet infrastructure that keeps working.

I'm suspicious of technology that needs to announce itself. The most transformative tools disappear into the workflow. You stop noticing them the same way you stop noticing electricity. They become assumed, ambient, load-bearing.

That's what I'm trying to build: systems that earn the right to be invisible. Not because they're trivial, but because they're reliable enough that attention can go elsewhere. The goal isn't to impress engineers with cleverness—it's to return human attention to problems that require it.

What You'll Find Here

Some posts will be technical—gradient descent, attention, the geometry of embeddings. Some will be philosophical—what it means to build minds, how tools change work, the ethics of automation. Some will be small observations that don't fit elsewhere.

I write to understand. If it helps you too, good.

writing — essays and writing
github — open source work

operational definitions

Henrik Albihn — Sun, 01 Feb 2026 12:00:00 GMT

:::note[Vanishing Gradient (noun)] Definition: In deep neural networks, the reduction of error signals as they propagate backward through layers, making it difficult for early (shallow) layers to learn. It also describes the tendency for transformative insights to lose strength as they travel back through our own beliefs, rarely reaching and reshaping our foundational assumptions. :::

The Architecture of Learning

A neural network learns by listening to its mistakes.

You show it something, it guesses, it's wrong. That wrongness becomes a signal—a whisper that travels backward through the architecture, asking each layer: how did you contribute to this error? How should you change?

This is backpropagation: the algorithm that made deep learning possible. It's elegant in concept—trace responsibility backward, assign credit where credit is due, adjust accordingly. The network improves because the signal carries information about exactly where things went wrong.

But signals decay over distance.

The Fading Signal

By the time the whisper reaches the earliest layers—the ones forming the most primitive representations, the foundations upon which everything else is built—it has faded to silence. The gradient has vanished. The deep layers learn. The shallow layers remain strangers to the truth that passed through them.

This isn't a bug in the algorithm. It's a mathematical inevitability. Each layer transforms the signal through an activation function—a nonlinearity that gives networks their power but also attenuates gradients. Multiply enough small numbers together and you get effectively zero.

For years, this limited how deep we could build. Networks with too many layers simply refused to learn properly. The foundations couldn't hear the feedback.

Why This Name

I named this blog after the phenomenon because I keep noticing it everywhere.

The insight arrives. It's vivid, urgent. You read something that rearranges your understanding. You have a conversation that shifts your perspective. You make a mistake that should teach you everything.

But by the time it propagates back to your foundations—to the premises you forgot you chose, the assumptions that calcified into identity—it's too faint to disturb anything. You update your conclusions while your origins stay frozen in place.

We mistake this for growth.

The Personal Vanishing Gradient

Consider how this works in practice. You discover that a belief you held was wrong. Maybe markets aren't efficient in the way you assumed. Maybe that friend who annoyed you was actually right. Maybe the career advice you've been giving was projection all along.

The correction enters at the surface level. You adjust the specific belief. But the deeper assumptions that generated the belief—your model of how markets work, your theory of what makes someone trustworthy, your understanding of what success means—those often remain untouched.

The gradient vanished before it reached them.

This is why people can acknowledge being wrong about something specific while remaining fundamentally unchanged. The error signal ran out of strength before it reached the load-bearing beliefs.

Residual Connections

The fix in neural networks is almost philosophical: create shortcuts. Residual connections. Pathways that let the signal skip across depth, arriving at the beginning with enough force to matter.

In a residual network, each layer adds its contribution to the original input rather than replacing it entirely. The identity of the signal is preserved, carried forward through a bypass that maintains gradient flow. The deep layers still do their work, but the shallow layers can finally hear the feedback.

This simple architectural choice—let the signal skip ahead—enabled the training of networks with hundreds of layers. The vanishing gradient didn't disappear, but it was routed around.

Building Residual Connections in Ourselves

The fix in ourselves is the same, and harder.

It means building deliberate channels back to first principles. When you encounter disconfirming evidence, explicitly ask: what assumption generated the prediction that just failed? And what assumption generated that assumption?

It means treating settled questions as open questions. The longer you've believed something, the more suspicious you should be of it. Certainty has a half-life; the beliefs you're most confident about are often the ones most in need of examination.

It means accepting that the deepest error might live in the shallowest layer—in something you decided so long ago you no longer experience it as a decision. Your earliest experiences set parameters that still constrain everything built on top of them.

The Discipline of Remembering

The gradient wants to vanish. Depth wants to forget. This is the default, the path of least resistance.

Remembering is architectural. You have to build structures that force the signal through, that maintain connection between surface updates and foundational assumptions. Journaling practices that revisit old beliefs. Conversations with people who knew you before. Deliberate exercises in reasoning from first principles rather than cached conclusions.

It's uncomfortable because it's meant to be. If the signal reached the foundations easily, the foundations would be unstable. Some attenuation is protective. But too much attenuation means you're learning only at the surface while the depths remain frozen.

The goal isn't to eliminate the vanishing gradient—it's to build enough residual connections that the truly important signals can still get through.

Vanishing Gradients