Org-Builders: If Nothing Off The Shelf Works, Consider Just Building It.

By Bill Chen @ 2026-05-13T21:06 (+15)

Some actionable steps for operators and org-builders, as we need scale-up org infrastructure in 2026.

Skip to the advice for building with LLMs.

Why Hasn’t Anyone Got a Product for Me?

EA and AI Safety orgs have weird structures compared to most other orgs. We run on grant funding, not sales. We’re UK CIGs or US 501(c)(3)s, but we are highly productive, lean operations. We have an unusually junior-skewed talent pool but substantial trust and agency. We don’t sell a hot product, but we're often still fast-scaling orgs.

This often means that product solutions out there often don’t fit our shape. I experienced this firsthand at Raise while trying to find a way to free up local field builders from bureaucratic bookkeeping. There was no good product providing the systemised visibility we needed for chapters from UCL to Edinburgh, over the finances coming in and out.

After a few months of trialling different solutions, I decided to just build one. Raise, like many other EA and AI Safety orgs, relied heavily on a modern workflow in Google Workspace, Slack, Airtable, and, of course, Claude. Because of this, just building a solution is often more viable than it initially seems.

Building a multi-hundred-variable product has taught me a thing or two about building infrastructure with frontier LLMs. I think this information is exceptionally useful during a period of rapid scaling in the AI Safety ecosystem, given multiple sources of increased funding (1, 2 & 3). As we found out at Raise, products that work at a scale of 5 don’t work at a scale of 15.

This serves as a call to action and a guide to what I’ve refined as best practice to help generalists, operators, and infrastructure builders to best solve problems.

Table of Contents

  1. Just Building It at Raise
  2. When Just Building It May Make Sense
  3. Why Isn’t Just Building It More Common?
  4. Advice for Building with LLMs
  5. When Just Building It Doesn’t Make Sense
  6. Closing Thoughts

Just Building It at Raise

Raise currently has over 10 accounts, and the buck stops with me for every one. It's a weird structure, reflecting the disparate nature of our field-building for effective giving across the UK: volunteer and student-led, encouraging agency but making compliance difficult.

An (outdated) list of chapters we’ve had active across the UK.

 

The way Raise has historically managed finances meant that each local chapter had to do its own bookkeeping via a treasurer. Our sole bank account was held with the National team, which would approve reimbursement requests. We’ve never been able to show, say, the Edinburgh chapter, to see their transaction history and balance.

Earlier this year, I went looking for an off-the-shelf product. Initially, a full banking migration to a new bank that supports sub-accounts for chapters was chosen, but it proved too complicated for our needs. The only non-bank, somewhat viable option I found was an accountancy software product for churches.

It was at this stage that I realised that our weird shape also had weird advantages - a modern stack with high levels of Claude-connectivity. A few days later, a tailor-made product has been created, ready for trialling with our Cambridge-based chapter. One that meant a position ‘treasurer’ is mostly redundant now across all of our chapters.

Raise’s needs are weird by conventional standards, and a good product didn’t exist for us, given how small the market is for a solution tailored to Raise’s weirdness. But this weirdness of organisational shape and needs is common in EA/AI Safety spaces. We should be open to recognising when mainstream solutions don’t work and just building something, utilising frontier models to build beyond internal ops capabilities.

When Just Building It May Make Sense

As a good intuition: if the best available alternative is a highly menial process or to pay someone to do it, consider if it’s possible to build it in-house with Claude and a full stack of connectors first.

I think the best term to describe this sort of internal product-scale building is vibe building, reflecting the heavy LLM-based outputs compared to the already heavily in-house, custom-designed infrastructure many orgs have built by hand.

Why Isn’t Just Building It More Common?

EA/AI Safety orgs already utilise a highly modern workflow, and nearly every software stack I see is highly integrated with Claude connectors. The standard lot: Docs, Sheets, Drive, AirTable, Slack, Asana, Forms, Canva, Figma, Notion, and Obsidian are all able to be deeply integrated into Claude, for it to draw deep context and often edit or make new files.

There’s little that these products can’t do for a small or scaling team. Claude Co-Work has gotten increasingly good at planning, testing, and executing goals. I think this is underexploited due to limitations in information diffusion in ops-heavy work. Namely:

At the same time, the tooling to actually build something well is also relatively recent:

The earlier proliferation of vibe coding compared to ops-style vibe building demonstrates the novelty of being able to just build now. Claude Code, Cursor, and GitHub Copilot have given developers genuinely useful tools for over a year, as frontier models worked well on bounded, static code bases with obvious verifiability loops. The same sort of performance with agentic work for vibe building was historically hampered by the fact that success is harder to define, verification loops are non-obvious, and tasks span many tools. The shift towards models that can actually carry these workflows through to completion is recent, and benchmark trends across tool orchestration and long-horizon coherence broadly align with my intuition that Opus 4.6 and 4.7 have been the first to deliver high-quality products.

Advice for Building with LLMs

Before you start

While you’re building

Once a prototype comes

When Just Building It Doesn’t Make Sense

Vibe-building isn’t always the right answer. A few clear signals to push back against it:

Closing Thoughts

The productive uplift circa May 2026 is moving quickly towards more messy, infrastructure- and ops-oriented work. I think we need to take our friends in the software engineering team more seriously. I find now to be an exceptionally impactful time to be building - the absorption constraints are often not funding nor talent. This piece is an ode and a call to action to all of my generalists and operators to get building the capacity we need.

For those wanting to try out building infrastructure, there are an increasing number of new programmes to do so: Generator, GovAILISA, Coefficient Giving, BlueDot & MATS.

Now go and build something.

Catch me at EAG London 2026.

Thank you to Mick Zijdel for reviewing a draft & 3 additional pieces of building advice.

Thank you to @Tym 🔸 for reviewing a further draft. 

This is cross-posted from my blog.


Lorenzo Fong Ponce 🔸 @ 2026-05-13T23:09 (+4)

Ops culture isn’t yet a building culture - the default mindset is “keep the trains running” rather than “ship something”.

real

SummaryBot @ 2026-05-14T16:32 (+2)

Executive summary: The author argues that many EA/AI Safety orgs should consider building their own internal tools with LLMs when off-the-shelf products don’t fit their unusual structures and needs.

Key points:

  1. EA/AI Safety orgs often have atypical structures (grant-funded, lean, junior-skewed, fast-scaling) that make existing software products a poor fit.
  2. In the author’s case at Raise, no suitable financial tooling existed, so they built a custom system with LLMs that reduced the need for local treasurers.
  3. Building in-house is especially appropriate when the market is too small, processes change frequently, constraints block standard tools, or simple bespoke solutions outperform complex software.
  4. This approach is underused due to low visibility of internal tools, bespoke solutions being hard to share, and an ops culture focused more on maintenance than building.
  5. Recent advances in LLM capabilities (connectors, agent workflows, long-horizon execution) make “vibe building” across tools newly viable for non-engineers.
  6. The author advises structured, iterative LLM use (clear goals, verification loops, modular builds, documentation, testing) and cautions against building when systems are fragile, sensitive, unmaintainable, or when good external or outsourceable solutions exist.

 

 

This comment was auto-generated by the EA Forum Team. Feel free to point out issues with this summary by replying to the comment, and contact us if you have feedback.