Articles.

Long-form engineering essays for founders, operators and technical leads who need software systems that remain useful when real business pressure arrives.

Start with a diagnosis before adding more tooling.

Most fragile systems do not need another dashboard first. They need the failure points mapped, the hot paths simplified and the data model made trustworthy.

Book a system diagnostic

Towards an AI Runtime That Learns Through Work

Useful AI memory may not come from storing more context beside the model, but from capturing the path of bounded actions, traces, corrections and repeated outcomes through the runtime itself.

Published by Crescita12 May 2026AI architecture

What if memory was not something added beside the system, but something created by the system’s own path of action?

I keep coming back to the idea that AI memory may be framed in the wrong place. Most systems treat memory as something adjacent to the model: a vector store, a conversation history, a document index, a saved fact, a larger context window. Those things can be useful, but they are not quite the same as experience.

Experience is not just information that was stored. Experience is information that has been tested by action. A system tries something, follows a path, reaches a result, and that result is accepted, rejected, corrected, repeated, avoided, reinforced or made more precise. Over time, the useful parts of that path become more stable. The weak parts become visible. The repeated corrections become pressure for the structure to change.

This is the thesis I want to explore: memory should not live only in a storage layer beside the runtime. In a more useful AI system, memory should also live in the action chain itself. The path a request takes through the system, the nodes it touches, the contracts it satisfies, the uncertainty it exposes, and the traces it leaves behind become part of the system’s intelligence.

The filesystem as a logic tree

One possible way to think about this is through the filesystem. Not as a proposed standard, and not because folders are magical, but because a filesystem already gives us a visible topology. It has a root. It has branches. It has local boundaries. It has containment. It has a physical structure that can be inspected.

In this architecture, the filesystem tree becomes more than storage. It becomes a logic tree. The root is the main harness that controls the flow of state through the system. It owns routing, contracts, snapshots, telemetry, versioning, update gates and recovery paths. Below it, each folder becomes a bounded node responsible for a specific kind of transformation.

A node is not a general-purpose agent. It is a narrow functional unit. It receives a specific input state, transforms what it can meaningfully process, and returns a specific output state. The deeper a request travels through the tree, the more specialised the handling becomes. At the top, the runtime may only know the broad shape of the request. As the packet moves through branches, each layer resolves more of the state.

The intelligence is not located in one place. It is distributed across the path. That path, repeated over time, becomes practical memory.

A packet moving through bounded nodes

The unit of work in this architecture is a flat state packet. The packet does not need to be clever. It needs to be explicit. It carries the task, the current payload, the context available at that point, any evidence attached to the state, the current confidence, the current status, and the trace of what has happened so far.

Each node receives the packet through an explicit contract. It should not reach freely into distant folders or silently import logic from unrelated parts of the tree. Hidden coupling makes the apparent structure misleading. Communication should happen through the runtime, through an API or message-bus style contract. A node should know what it is allowed to receive, what it is allowed to emit, and what transformation it is responsible for.

That restriction is important because it makes experience traceable. If a packet passes through several nodes and the output fails, the system can inspect the path. It can see where confidence dropped, where the contract was stretched, where an oracle was called, where a fallback was used, where the packet changed shape, and where a later correction attached to the result. Without that structure, memory becomes vague. With that structure, memory becomes causal.

Experience as reverse propagation

There is a kind of backpropagation in this idea, but not in the strict neural-network sense. No mathematical gradient is being pushed through model weights after every request. The more useful comparison is reverse propagation of experience through the execution tree.

A packet travels forward through the runtime and reaches a result. That result produces evidence about the path that created it. If the result was strong, the path gains confidence. If the result failed, the weak part of the path becomes visible. If a correction was made, that correction can be attached to the node, contract, context, oracle call or transformation that likely caused the issue.

Over time, the system begins to build a history of its own behaviour. It learns which paths are reliable for which input states. It learns which nodes are too broad. It learns which constants are weak. It learns where deterministic code is missing. It learns where a small oracle is being asked too much. It learns where the shape of the contract is wrong.

This is the main memory mechanism. The memory is not simply “this fact was stored before”. The memory is “this kind of state moved through this path, failed at this point, was corrected in this way, and now the path has changed”. That is learned action.

Short memory, long memory and graph memory

There are still different kinds of memory in this system, but they serve different purposes. The short-term memory is local and practical. It lives in constants, recent traces, local context, rolling metrics and small adjustments around a node. It helps the node behave better without turning every request into a large prompt or a broad retrieval problem.

The longer-term memory comes from accumulated traces, accepted outputs, rejected outputs, corrections, tests, model decisions, contract changes and version history. This becomes the evidence base for future evolution. It is not just stored context; it is the historical record of what the runtime tried and what happened.

A semantic graph can support this, but it should not be confused with the core action memory. The graph is useful for context that is not directly required to fulfil the current transformation. It can store entities, documents, relationships, runtime logs, evidence, historical decisions, failure patterns and broader structural knowledge. The action chain is the core memory of the system. The graph is the wider context around that memory.

That distinction matters. If the graph becomes the main mechanism, the system risks drifting back into retrieval-heavy memory. If the action chain remains the main mechanism, the system learns through use. The graph then becomes a supporting structure that helps explain, enrich and reuse what the action chain has already discovered.

Why boundedness matters

For this kind of architecture to work, every layer needs limits. A node should not grow indefinitely. A file should not become an unbounded pile of special cases. A folder should not absorb every neighbouring responsibility. A prompt should not expand forever. A local oracle should not become a hidden general-purpose reasoning engine. A contract should not keep widening until it accepts almost anything.

The limits are not arbitrary. They create evolutionary pressure. A node can evolve inward for a while. Its deterministic logic can improve. Its constants can become sharper. Its local context can become better tuned. Its oracle question can become narrower. Its output contract can become more stable.

Eventually, the node may reach a point where adding more internal logic would make it worse. At that point, the right move is not to keep expanding the node. The system should evolve outward. A wrapper may be added. A child node may be created. A sibling node may handle a separate branch of the state. A preprocessing layer may prepare the packet before it reaches the completed node. A postprocessing layer may improve confidence after the node has returned its result.

A node becomes intelligent by becoming bounded enough to be reliable. Once it is reliable, further learning should not destroy that reliability. It should build around it.

Deterministic code before model growth

The order of evolution also matters. A failed output should not immediately imply a larger model, a longer prompt or specialist training. The first question should be whether the deterministic structure around the state is good enough.

Maybe the code is wrong. Maybe the packet contract is unclear. Maybe the output schema is too loose. Maybe the constants are weak. Maybe the node is too broad. Maybe the wrong branch handled the packet. Maybe the oracle was asked an overly broad question. Maybe the runtime does not yet have enough examples to justify model specialisation.

The conservative path is to improve code first, then contracts, then constants and local context, then oracle prompts and model selection. Specialist training should come late, after the system has accumulated enough high-quality operational history. Training without experience is mostly speculation. The runtime needs to earn its dataset through use.

The model should not be trained on vague ambition. It should be trained on the system’s own history of bounded transformations.

Oracles as narrow uncertainty resolvers

The oracle is still useful, but it should not own the whole task. Inside this architecture, an oracle is used when deterministic logic reaches a fuzzy state. The runtime should already know what it is uncertain about before the oracle is called.

That uncertainty should be shaped into a narrow question. The oracle should not be asked to analyse the entire system, rewrite the whole branch, or decide the full outcome of a broad request. It should be asked to resolve a specific ambiguity that the surrounding code can validate.

This allows smaller models to be useful earlier. A small generic oracle may not be capable of broad reasoning across a whole workflow, but it may be useful for a constrained local decision if the input is structured and the output options are limited. As the node evolves, the oracle path can change. The system may start with a small generic oracle, improve the surrounding context, move to a larger generic oracle only when justified, and eventually train a specialist model only when the trace history is rich enough.

The direction should be towards less uncertainty, not more dependence. If the same oracle decision appears repeatedly, that pattern should be promoted into deterministic logic, constants, tests, context adjustments, contract changes or training data.

Constants as local context

In this architecture, context should not be treated as a long conversation. The system should not rely on a large prompt remembering everything. Context should be local, explicit and structured. Constants belong beside the code that uses them. They should be flat where possible, easy to inspect, easy to diff and easy for both deterministic code and small models to consume.

This makes context part of the evolvable structure. A constant can change because a trace showed repeated failure. A threshold can shift because a node became too strict or too loose. A local dictionary can improve because a pattern appeared often enough. A prompt fragment can become smaller because deterministic code absorbed part of the decision.

The important point is that context is not a vague memory cloud. It is a controlled part of the node. Just like code, it has limits. If context grows too large, that is a signal. The node may be doing too much, the contract may be wrong, or the oracle may be compensating for missing structure. The system learns not only by adding context, but by discovering when context should be replaced by structure.

When experience becomes training data

Specialist model training is the late-stage form of this memory. By the time training becomes useful, the system should already have a rich history of input states, output states, traces, corrections, failures, accepted results, rejected results, oracle decisions, contracts, tests and version changes.

That history is much more valuable than a generic document set because it contains action. It shows not only what the system saw, but what it did, how it failed, how it was corrected, and what eventually became reliable.

A specialist model trained from that history would not simply be memorising facts. It would be absorbing a bounded transformation that has already been explored by the runtime. Even then, the model should not erase the harness. The harness still provides contracts, validation, traceability, thresholds and controlled evolution. The model may become a better local oracle. It may reduce the need for some surrounding logic. It may make the node more compact. But the action chain still matters because it is the source of future memory.

The model becomes a compressed expression of experience. The runtime remains the structure that creates new experience.

A self-healing system, carefully bounded

This leads to a careful version of self-healing. Not uncontrolled mutation. Not code silently rewriting itself. Not a runtime drifting away from its original shape.

A bounded self-healing system would detect pressure through traces. A repeated failure crosses a threshold. A node becomes too heavy. A contract produces too many ambiguous states. An oracle path becomes too frequent. A context file grows beyond its useful size. A branch becomes unreliable for a class of packets.

When that happens, the system should snapshot the current state, lock the affected branch, create a proposal, run tests, compare against historical traces, and only promote a change if it improves the bounded behaviour without damaging what already works. That is the difference between evolution and chaos.

The runtime can change, but it changes through evidence. The tree can grow, but it grows through pressure. A node can specialise, but only because the previous path showed why specialisation was needed. Every change remains attached to the history that caused it.

The semantic graph as secondary memory

The semantic graph still has a strong place in this architecture. It is useful for storing context that does not need to be directly present inside the action chain. It can preserve wider relationships between documents, entities, traces, failures, patches, contracts, tests, outputs and model versions. It can help the runtime find similar past states, recover supporting evidence, discover repeated patterns and prepare future training sets.

But the graph should support the action chain, not replace it. The main intelligence comes from the system repeatedly transforming state through bounded paths. The graph makes those transformations easier to understand and reuse. It provides the wider map, but the actual learning pressure comes from the packet moving through the tree.

That separation keeps the architecture grounded. The graph stores wider memory. The tree performs learned action. Together, they form a system where context and execution can reinforce each other without collapsing into one unstructured memory layer.

A quieter AI OS thesis

This is not a manifesto for a finished architecture. It is a thesis about where memory might belong.

Instead of treating memory as a separate database beside an AI model, the system can treat memory as something created by action. A request moves through a tree. Each layer transforms the state. Each transformation leaves a trace. Each trace becomes evidence. Each correction becomes pressure. Each repeated success becomes a stable path. Each stable path becomes part of the system’s intelligence.

The filesystem tree is only one possible shape for this. Its value is that it makes boundaries visible. It encourages local responsibility. It makes growth inspectable. It turns architecture into something that can be constrained, measured and evolved.

The deeper idea is that intelligence does not have to live only inside model weights. Some of it can live in the path. Some of it can live in the contracts. Some of it can live in constants. Some of it can live in tests. Some of it can live in traces. Some of it can eventually be compressed into specialist models.

A useful AI runtime may therefore be less like a chatbot with memory attached, and more like a structured system that remembers by doing: a logic tree that learns through trial and error, a harness that turns repeated action into process memory, and a bounded runtime where every successful path, failed path and corrected path becomes part of the intelligence of the system.

Back to articles

Self Learning Software Should Grow Through Use

Why useful AI systems should improve through repeated work, turning pressure, correction and operating memory into clearer structure rather than larger prompts.

Published by Crescita13 May 2026AI architecture

The previous article explored the idea that AI memory should not sit outside the work.

Memory becomes far more useful when it is created by the work itself, through the path a request takes, the decisions made along the way, and the corrections that follow.

The next step is learning.

If a system can remember what happened during work, it should eventually be able to improve from that history. Not by becoming an uncontrolled agent. Not by rewriting itself freely. Not by treating every failure as a reason to use a bigger model.

The more useful idea is quieter than that.

Software should grow through use.

A system that handles the same kind of work repeatedly should become better at that work over time. It should notice where effort is wasted. It should recognise where people keep stepping in. It should see where the same uncertainty appears again and again. It should know which parts of its process are reliable and which parts are fragile.

Most software does not do this. It performs the work, but the learning happens somewhere else. A person fixes the output. A ticket is created. A note is written. A meeting happens. A patch is made. The system may be improved, but it does not really understand why it needed to improve.

That is the missing loop.

Self learning software should close that loop.

Learning should come from pressure

A useful system does not need to change constantly. In fact, constant change is a bad sign. Stable things should stay stable.

The better signal is pressure.

Pressure appears when the same part of a process keeps needing attention. A workflow keeps failing in the same place. A decision keeps needing review. A report keeps needing correction. A document keeps being misread. A rule keeps being misunderstood. A model keeps receiving the same kind of rescue prompt. A piece of logic keeps stretching beyond what it was built to handle.

That pressure is not noise. It is the system telling us where learning is needed.

The mistake would be to treat every pressure point as a model problem. Sometimes the model is not the issue. The process may be unclear. The boundaries may be wrong. The input may be too loose. The output may not be specific enough. The system may be asking for judgement where a rule should exist. Or it may be using rules where judgement is genuinely required.

A self learning system should not rush toward intelligence. It should first understand the shape of the pressure.

That is the difference between a system that merely reacts and a system that matures.

The system should become more structured, not more chaotic

When people talk about self learning software, it often sounds like software becoming more fluid. The system adapts, mutates, rewrites, expands and improvises.

That is not the version worth building.

The better direction is the opposite. A system should become more structured as it learns.

If something is repeatedly corrected, that correction should become part of the process. If a decision becomes predictable, it should become a rule. If a pattern keeps appearing, it should become a recognised shape. If a judgement keeps requiring the same context, that context should move closer to the work. If a task is too broad, it should split into smaller responsibilities.

Learning should reduce confusion.

The system should not grow by adding endless context to every request. It should grow by turning repeated experience into clearer structure.

This matters because the real danger with AI software is not that it fails once. Normal software fails too. The danger is that it keeps failing in ways that remain hard to inspect. More prompts, more memory, more retrieved context and larger models can hide that problem for a while, but they do not necessarily make the system more understandable.

A self learning system should become easier to reason about over time.

If it becomes harder to understand every time it learns, it is not maturing. It is accumulating mess.

Models should absorb proven experience, not guess the process

There is a place for specialist models in this architecture, but they should come later.

A model should not be trained too early, before the system knows what it is really trying to learn. Early training is often based on assumptions. The team imagines the task, prepares examples, writes expected outputs and hopes the model learns the right behaviour.

That can work, but it misses something important.

The strongest training material comes from work that has actually happened.

Real work contains the awkward cases. The partial failures. The corrections. The edge cases. The moments where the process was unclear. The points where people disagreed. The examples that looked easy but were not. The repeated patterns that only became obvious after the system had been used.

That history is valuable because it was earned.

A specialist model trained on that history is not being asked to invent the process from scratch. It is being asked to absorb a process the system has already explored. The model becomes a compressed form of experience, not a replacement for experience.

That is a much healthier role for AI.

The model should not be the first answer to uncertainty. It should be what the system earns after enough uncertainty has been observed, corrected and shaped into a learnable pattern.

Memory should become capability

There is a difference between remembering something and becoming better because of it.

A system can store huge amounts of history and still not improve. It can retrieve previous examples and still make the same mistakes. It can keep logs forever and still not know what they mean.

Useful memory has to change future behaviour.

If a report was corrected ten times, the eleventh report should benefit. If a workflow failed repeatedly, future workflows should avoid the same path. If a particular kind of input creates uncertainty, the system should recognise it sooner. If a person keeps fixing the same output, that correction should eventually become part of the system’s operating knowledge.

That is the movement from memory to capability.

The system is not just keeping a record. It is turning experience into better work.

This is also where the previous article connects directly to the self learning proposition. If memory is created by the action chain, then learning is the process of using that memory to improve the action chain.

Work creates memory.

Memory exposes pressure.

Pressure guides improvement.

Improvement becomes capability.

That is the loop.

The maintainer remains the judge

Self learning does not mean self approving.

A serious system should be able to propose change, but not silently promote every change into production. The maintainer still matters because the maintainer owns the standard.

The system can show where pressure is building. It can show what keeps failing. It can suggest where structure should change. It can test whether a proposed improvement would have helped past cases. It can compare old behaviour against new behaviour.

But the final decision should remain reviewable.

That is what makes the idea safe enough to take seriously. The system is not trusted because it is autonomous. It is trusted because it can explain why change is being proposed.

A good self learning system should be able to show where the process keeps failing, which examples prove the pattern, what the smallest useful change might be, what could break, and how to roll back.

That is not magic. That is disciplined learning.

Why this matters for companies

Companies already teach their systems every day, but mostly in ways the systems cannot absorb.

Every manual correction is a lesson. Every repeated support issue is a lesson. Every delayed report is a lesson. Every exception process is a lesson. Every internal workaround is a lesson. Every judgement call that happens outside the software is a lesson.

The problem is that these lessons usually remain outside the system.

They become messages, meetings, documents, habits and informal knowledge. The company learns, but the software stays mostly static.

That is a strange mismatch. The software is often where the work happens, but not where the learning accumulates.

A self learning runtime would change that. It would let operational knowledge collect around the work itself. It would make repeated friction visible. It would help stable processes become more stable and unstable processes become candidates for improvement.

The value is not simply automation. The value is compounding operational memory.

A company using this kind of system would not just execute more quickly. It would become better at understanding how its own work actually happens.

The self learning proposition

The practical future of AI in software may not be one giant model sitting above every process.

It may be systems that learn from the work passing through them.

They do not need to remember everything. They need to remember what changes future execution. They do not need to mutate constantly. They need to evolve when pressure proves that evolution is justified. They do not need to replace maintainers. They need to give maintainers better evidence.

The important shift is from passive software to experienced software.

Passive software runs a process.

Experienced software carries the history of that process forward.

It knows where it has succeeded. It knows where it has failed. It knows where people keep correcting it. It knows which parts have become reliable and which parts still need help.

That is the version of self learning software worth building.

Not software that becomes more mysterious over time.

Software that becomes clearer, more reliable and more capable because it has done the work before.

Back to articles

Attention Is All You Need. Context Is All You Get.

Attention may power the model, but structured context determines whether an AI system becomes useful, distracted or confidently wrong.

Published by Crescita1 May 2026Context engineering

Attention may be all the model needs, but context is what decides whether it becomes useful, distracted, or confidently wrong.

In 2017, a paper called Attention Is All You Need helped kick the door open for the modern Transformer era. The title was elegant, slightly smug, and unfortunately memorable enough to become the “Live, Laugh, Love” wall decal of machine learning.

The paper’s point was technical: Transformer models could use attention mechanisms instead of recurrence or convolution for sequence modelling. It was not saying, “throw forty-seven random files into a prompt and let the silicon oracle sort it out.”

But here we are.

We now have models that can read codebases, write software, inspect documents, run tools, call APIs, and explain themselves with the calm confidence of a senior engineer who has just deleted the wrong database. The models are impressive. The tooling is impressive. The context windows are bigger. The demos are shinier.

And yet the same problem keeps appearing: the model is not stupid. The context is.

The junk drawer problem

Most AI failures in business software do not happen because the model cannot reason. They happen because the model is handed a badly packed suitcase and told to produce a clean itinerary.

A common workflow looks something like this: “Here are the project files. Fix the issue.”

Then the agent receives source files, logs, configs, documentation, stale notes, unrelated modules, migration leftovers, half-written TODOs, old architectural decisions, and possibly a README last updated during the reign of jQuery.

Technically, the model has context. Practically, it has been mugged by information.

That is not context engineering. That is rummaging.

This is where hallucination and scope creep start creeping in. Not because the model is malicious. Not because attention failed. But because relevance has been treated as a boolean switch: either the model has access to the files or it does not.

Context is not more stuff

A lot of modern AI coding and document agents follow a similar shape: gather context, reason over the context, take an action, check the result, and repeat. That is the right loop. But the phrase “gather context” hides the most important question: gather which context?

For a small bug, the right context might be three functions, one failing test, and a recent error trace. For a compliance report, it might be a chain of document references, issue dates, approvals, contradictions, and missing signatures. For a codebase, it might be architectural intent rather than the literal content of every file.

Useful context is rarely “everything nearby.” It is the minimum structured evidence needed to reason without making things up.

Bigger context windows do not remove the problem. They simply make the junk drawer deeper.

Memory should not be a scrapbook

One half of the answer is memory. But not memory in the cosy product-marketing sense. Not a glorified notes folder. Not a pile of previous chats and documents stuffed into a vector database with crossed fingers and optimistic vibes.

Useful memory should behave more like a graph.

A plain memory system says: “This thing was mentioned before.” A useful memory system says: “This thing matters when the task involves that thing, conflicts with this other thing, and should be weighted differently depending on the current objective.”

That distinction matters.

For AI systems, memory is only useful if it can answer operational questions: what matters here, what has changed, what is still uncertain, what has already been decided, what should not be re-litigated, and what looks similar but is actually unrelated.

Without that structure, memory becomes nostalgia. The model remembers everything with equal emotional significance, like a drunk uncle at a wedding.

A strong memory layer should treat previous information as a living network of meaning, not a shoebox of old prompts.

The missing layer: a context compiler

Memory is only one half of the problem. The other half is extraction.

This is where the idea of a context compiler becomes important. The compiler’s job is not to ask the model nicely. Its job is to take messy input and turn it into structured, inspectable, task-relevant context before the model gets involved.

That input might be PDFs, CSV files, SQL dumps, JSON exports, XML documents, plain text, emails, project folders, contracts, reports and logs.

The goal is not to dump all of that into a context window. The goal is to compile it.

A compiler does not treat source code as a bag of words. It lexes, parses, resolves references, builds intermediate representations, checks meaning, and only then produces output. The same principle should apply to business context.

Raw files are not context.
Parsed facts are closer.
Linked facts are better.
Validated, task-specific facts are where the real value begins.

A context compiler is the part of the system that says: “Before we ask the model to think, let’s make sure it is thinking about the right thing.”

That is the difference between AI as a parlour trick and AI as infrastructure.

Attention still needs an adult in the room

The funny thing about Attention Is All You Need is that the title is both brilliant and dangerously easy to misuse. Attention is an internal mechanism. It helps the model weigh relationships between tokens.

It does not magically decide whether your uploaded folder contains the right evidence, whether your logs are stale, whether your PDF is superseded, or whether the file called final_final_v3_REAL.pdf is actually final.

That is outside the model. That is system design.

This is the part many AI demos quietly skip over. The model is not the product. The harness is the product. The context pipeline is the product. The memory layer is the product. The verification loop is the product.

The model is the reasoning engine inside a much larger machine.

The real bottleneck is not intelligence

Businesses often ask: “Can AI understand our documents?” Usually, yes. The better question is: “Can the system extract the right context from those documents, preserve it, verify it, and present it to the model without drowning it in irrelevant noise?”

That is harder. It is also where the money is.

Because once you can extract the right context, everything downstream improves. The model answers more accurately. Reports become more reliable. Agents take fewer weird detours. Outputs become easier to audit. Hallucinations become easier to catch. Scope creep becomes easier to constrain.

And the system starts behaving less like a chatbot with a filing cabinet and more like an operational layer.

The punchline

Attention may be all you need to build the model. But it is not all you need to build useful AI software.

Useful AI needs memory. It needs structure. It needs context selection. It needs compaction. It needs verification. It needs a compiler between the chaos of real-world data and the neat little rectangle where the model pretends everything is fine.

So yes, attention is all you need. Assuming someone has already done the hard work of making sure the model is paying attention to the right thing.

That someone is not the model. That someone is the architecture.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles

The Principle of Least Action for API Design

Fast, reliable APIs are rarely the result of clever tricks. They come from removing non-essential work from the request path, shaping data before it is needed, and proving behaviour under load.

Published by Crescita29 April 2026Backend systems

The fastest API is not the one with the most exotic infrastructure behind it. It is the one that knows exactly what work belongs in the request path, and has the discipline to keep everything else out of the way.

A lot of API performance work starts in the wrong place. The endpoint is slow, so the first instinct is to reach for more workers, more dynos, a larger database instance, a more aggressive cache, or a queue bolted onto the side. Those may all have their place, but they do not fix the underlying shape of the system. They only give a poor shape more room to breathe.

The better starting point is the hot path: the exact sequence of work that happens every time a request comes in. That path is where latency compounds, where database pressure collects, where object allocation quietly grows, and where a harmless-looking branch turns into an expensive multiplier once traffic arrives.

In physics, the principle of least action describes systems following a path that minimises a quantity called action. In software, the metaphor is useful even if the mathematics is different. A request path should perform the smallest amount of synchronous work required to produce a correct and useful response. Not the smallest amount of work overall. The smallest amount of work that must happen right now.

The API hot path is not the place to discover business meaning from scratch. It is the place to serve meaning that has already been prepared.

Most overloaded endpoints become overloaded gradually. They begin as a clean route: validate the request, check permission, fetch data, return a response. Then a reporting requirement appears. Then a score needs to be calculated. Then the response needs enrichment from another table. Then a third-party lookup is added. Then extra analytics are written. Each addition is reasonable in isolation. Together, they convert the endpoint from a serving layer into a live reconstruction engine.

That distinction matters. Serving is cheap when the representation is already close to the shape the caller needs. Reconstruction is expensive because the system has to rediscover relationships, aggregate rows, interpret rules, calculate derived values and format the result while the user waits.

A useful review of an API usually starts with five blunt questions:

What does the caller actually need before the response can be returned?
Which calculations can be performed before the request arrives?
Which database reads repeat across requests and should be cached, materialised or indexed differently?
Which side effects can be moved into a queue, log stream or batch process?
Which objects, joins or serialisation steps exist only because the internal model is leaking into the public response?

The answer is often not a single optimisation but a separation of responsibilities. Ingestion should collect raw truth. Computation should turn that truth into derived structures. Storage should preserve the structures needed by the product. Serving should read from those structures with as little live interpretation as possible.

This is why denormalised read models, materialised views, compact lookup tables, spatial indexes, search indexes and precomputed scores are not premature optimisation when the workload is known. They are ways of respecting the difference between writing truth and serving answers.

The same principle applies below the database layer. Single-thread performance still matters because every concurrent system is built out of serial sections. If the inner loop is wasteful, concurrency multiplies the waste. If every request allocates too many objects, traverses data in a cache-hostile layout, repeats parsing, or walks through abstraction layers that add no value to the hot path, more workers merely make the machine burn harder.

Before parallelising, it is worth finding the shape of the serial work. Profile the endpoint. Count queries. Inspect serialisation. Measure allocation. Look at the loops that run for every request. Check whether data is being copied, converted or re-sorted because two parts of the system disagree about representation. The boring measurements often reveal the real fault line.

Concurrency becomes useful once the core path is clear. It is not just about starting more workers. It is about splitting work without losing determinism, correctness or observability. A parallel process that produces different answers depending on timing is not an optimisation; it is a liability with better CPU utilisation.

For batch-heavy systems, deterministic concurrency usually means a parent process owns ordering, identity allocation and final mutation, while workers handle isolated units of work. Results can then be applied in a stable sequence. That design is less glamorous than a free-for-all worker pool, but it is much easier to test, replay and trust.

Load testing should then be used to verify the shape of the system, not to produce a heroic number for a slide deck. A useful load test answers practical questions: which endpoint bends first, whether the bend is caused by the application, database, network, queue or infrastructure, and whether the failure mode is graceful enough for production.

This also means separating background noise from real failure. Production systems receive bad requests, bot traffic, malformed payloads, repeated retries and upstream nonsense. Those events matter, but they should not be confused with the behaviour of valid business traffic. A clean analysis separates request classes, payload types, response codes, latency bands and time windows before making an infrastructure decision.

The practical pattern is simple: make the hot path lean, make the data shape honest, make concurrency deterministic, and make load tests evidence-led. Do that before adding complexity. Most businesses do not need a more theatrical backend. They need fewer surprises in the path that runs every single time.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles

AI Works Only as Well as the System Beneath It

AI is not a magic layer you can place on top of broken operations. It needs structured context, stable tools, reliable permissions and a system it can safely act through.

Published by Crescita29 April 2026Operational AI

The uncomfortable truth about business AI is that the model is rarely the first problem. The system around the model is.

A company can buy access to an excellent model and still get poor results if the model has no reliable way to understand the business, inspect the right data, respect permissions, or take action through controlled tools. In that situation, the chatbot becomes a polite guessing machine attached to a messy operating environment.

This is why many businesses do not need an AI chatbot yet. They need the foundations that would make an AI assistant useful in the first place: clean data ownership, predictable workflows, explicit permissions, stable APIs, searchable documents, and a clear model of what the system is allowed to know and do.

AI does not remove the need for structure. It increases the value of structure.

A useful way to think about AI is as an interface layer over operational capability. The model can interpret intent, summarise context, reason over options and generate plans. But the business still needs real tools underneath: systems that read from the source of truth, write to the correct place, record what changed, and fail safely when the request is ambiguous.

If those tools do not exist, the AI has to improvise. It reads stale exports, copied spreadsheets, incomplete documents and fragments of conversation. It may sound confident, but confidence is not the same as operational truth.

A practical AI system therefore needs three layers working together:

A context layer that can retrieve the right documents, records, code, logs or business entities.
A tool layer that exposes safe actions with explicit inputs, outputs and permissions.
A control layer that decides when the model may act, when it must ask, and when it should refuse.

This same problem appears in software development. A large codebase is not just text. It is a living system of dependencies, conventions, domain rules, migrations, tests, exceptions and scars. If an AI coding tool only sees a few files at a time, it may produce code that looks correct locally but violates the architecture globally.

Large repositories need compression, but not the lossy kind that turns everything into vague summaries. They need structured compression: symbol indexes, dependency graphs, module maps, API boundaries, data-flow notes, test relationships, architectural decisions and change history. The goal is not to stuff the whole repository into a prompt. The goal is to give the model the right slice of the system at the right moment.

That is closer to a compiler than a search box. A compiler does not treat source files as random text; it tokenises, parses, resolves symbols, checks types and builds intermediate representations. A codebase intelligence layer should do something similar at a practical level. It should know that a function is called from a task, that a model field affects an API response, that a migration changes a reporting assumption, and that a test fixture encodes a business edge case.

The same principle applies outside code. Operational documents, contracts, job sheets, compliance records and customer histories all need to be converted from passive content into usable context. That does not mean turning every document into a vector and hoping the answer appears. It means extracting entities, dates, obligations, status, ownership and relationships in a way that downstream tools can verify.

Once the system has structured context, AI becomes much more useful. It can answer questions against the current state of the business. It can draft work from reliable inputs. It can explain why a decision was suggested. It can route tasks to the right workflow. It can identify missing data instead of pretending the data exists.

Permissions are just as important. A human employee carries context about what they are allowed to see, change and approve. An AI assistant needs that boundary made explicit. Which documents can it read? Which records can it modify? Which actions require confirmation? Which operations are read-only? Which outputs are suggestions rather than decisions?

Without those boundaries, AI becomes either too dangerous to trust or too restricted to help. The useful middle ground is tool-mediated action: the model can propose or execute only through narrow interfaces that validate inputs, log decisions, and preserve accountability.

The strongest AI systems are not built by sprinkling prompts across a business. They are built by turning messy operational reality into a machine-usable substrate. That means doing the unglamorous engineering first: data modelling, API cleanup, document indexing, workflow mapping, audit trails and permission design.

The result is less flashy than a generic chatbot demo, but far more valuable. A good operational AI system should feel like an intelligent layer over a business that already knows where its truth lives. The model should not be forced to become the system of record. It should be a disciplined interface to systems of record that already work.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles

Graph Thinking for Business Systems and Spatial Data

Road networks, spatial scoring systems and business workflows all expose the same structural question: which connections matter, which details can be compressed, and what must never be broken.

Published by Crescita29 April 2026Systems thinking

A business system is often closer to a road network than a spreadsheet. It has junctions, bottlenecks, shortcuts, dead ends, high-value routes, local noise and hidden dependencies.

That observation becomes useful when designing software. Many operational problems are not primarily form problems or dashboard problems. They are topology problems. The system does not fail because a screen is ugly. It fails because information moves through the wrong path, approvals get trapped at the wrong junction, records branch without reconciliation, or important connections are invisible until something breaks.

Graph thinking gives us a language for that. A graph is made of nodes and edges: things and the relationships between them. In a road network, nodes might be intersections and edges might be road segments. In a business, nodes might be customers, jobs, invoices, assets, documents, employees, approvals or locations. Edges are the relationships: owns, depends on, blocks, approved by, supersedes, located near, derived from, assigned to.

The practical question is not whether everything can be represented as a graph. The practical question is whether the graph view reveals structure that the current system hides.

Good systems preserve the connections that carry meaning and compress the details that do not.

Spatial systems make this very obvious. If a scoring API needs to evaluate a location against nearby amenities, transport links, boundaries or risk zones, the naive approach is to rediscover geography for every request. Query the points, measure the distances, classify the categories, calculate the weights, aggregate the result, return the score. It works until volume, density or complexity arrives.

The better pattern is prepared geography. Expensive spatial meaning is calculated before the request path. Areas can be indexed into cells. Influence zones can be precomputed. Overlapping regions can be classified. Dense clusters can be summarised. The live request then becomes a lookup and composition problem rather than a full geographic investigation.

That shift is not only about speed. It changes the reliability of the system. When spatial meaning is prepared in a controlled pipeline, it can be inspected, versioned and validated. When it is recreated live, every request becomes a miniature experiment.

Road networks provide another useful lesson: simplification is dangerous if it breaks topology. You can remove visual detail from a path without changing the route it represents, but you cannot casually delete a junction, reverse an edge, disconnect a neighbourhood or remove an access point and expect the system to behave the same.

The same mistake happens in business process automation. A team sees duplicated steps, manual checks or local variations and decides to “simplify” the workflow. Sometimes that is right. Sometimes those apparently messy branches are where risk, compliance, domain knowledge or customer-specific handling lives. Remove them blindly and the system becomes cleaner on paper but less truthful in production.

A graph-oriented review asks different questions:

Which nodes are structural anchors that must be preserved?
Which edges carry essential business meaning?
Which local details can be compressed without changing outcomes?
Where are the bottlenecks, isolated islands and accidental single points of failure?
Which relationships are implicit today but should be represented directly?

Large graph workloads also force discipline around representation. Object-heavy structures are convenient early on, but they become expensive when the graph grows. Every Python object, tuple, dictionary entry or repeated reference carries overhead. At small scale, that overhead is invisible. At large scale, it becomes the system.

The solution is often less about inventing a clever algorithm and more about changing the internal representation. Dense integer IDs, compact columns, memory-mapped arrays, compressed adjacency lists, bitmaps for removal state, and append-only stores for synthetic edges can turn an impossible in-memory workload into a bounded, inspectable pipeline.

That design also supports safer mutation. Instead of constantly rebuilding the graph, a system can mark removed edges with tombstones, append new synthetic structures separately, and materialise the final output deterministically. This matters because large optimisation jobs are difficult to debug if every pass rewrites the world in place.

There is a general business lesson here. Many systems become fragile because they mix raw facts, derived meaning, temporary state and final outputs in the same structure. Once those layers are separated, the system becomes easier to reason about. Raw records stay raw. Derived structures are rebuilt from clear rules. Serving layers read from prepared views. Debug output is bounded and intentional.

Graph thinking does not mean every company needs a graph database. Sometimes a relational schema with the right join tables is enough. Sometimes a search index, spatial index, materialised view or simple adjacency table is the right tool. The important point is conceptual: treat relationships as first-class engineering objects.

When a workflow, spatial model or operational platform is designed this way, compression becomes safer. You can reduce noise while preserving connectivity. You can speed up queries without flattening meaning. You can build dashboards that reflect real dependencies rather than just counts. You can scale the system without losing the shape that made it useful.

The strongest systems know their topology. They know where things connect, where pressure accumulates, where shortcuts are safe, and where a tiny edge carries a large amount of meaning. That is as true for geography as it is for operations.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles

Document-Led Operations Need Compilers, Not Dashboards

Spreadsheets, PDFs, drawings, folders and exported reports are not the enemy. The problem starts when passive documents quietly become the operating system of the business.

Published by Crescita29 April 2026Operations

Spreadsheets are not the problem. PDFs are not the problem. Drawings are not the problem. The problem starts when passive documents become the place where operational truth lives, mutates and hides.

Most businesses accumulate document-led operations because documents are flexible. A spreadsheet can become a tracker in an afternoon. A folder structure can become a filing system without a procurement process. A PDF can be emailed, marked up, approved and archived. A drawing pack can carry months of project history. None of that is wrong. The trouble begins when the business starts relying on documents to behave like software.

A spreadsheet is a useful working surface. It is a poor database once multiple people depend on it for truth. A PDF is a useful delivery format. It is a poor workflow engine. A folder is a useful container. It is a poor audit model. A dashboard is a useful view. It is a poor substitute for understanding what the underlying documents actually mean.

The goal is not to replace every document. The goal is to stop pretending documents are structured systems when no structure has been extracted.

This is especially visible in document-heavy sectors. Project teams often operate through drawings, schedules, approvals, revisions, trackers, emails, sign-offs and compliance records. The state of the work exists, but it is scattered across file names, title blocks, revision notes, issue sheets, approval comments and informal conventions.

A generic dashboard cannot solve that. It can display charts after someone else has manually interpreted the documents, but it does not understand the project. The real work is document intelligence: extracting structured facts, validating them, linking them across files, and compiling them into a model the business can query.

That is why the compiler metaphor is useful. A compiler does not merely display source code. It reads text, tokenises it, parses structure, resolves references, checks meaning and emits a usable output. Document-led operations need the same kind of pipeline at a business level.

A practical document compiler would do several things:

Read files and extract structured entities such as dates, revisions, parties, assets, locations, obligations and statuses.
Resolve references between documents, drawings, schedules, approvals and trackers.
Detect contradictions, missing approvals, outdated revisions and ambiguous ownership.
Create a project-level representation that can support reporting, alerts, audit trails and decision-making.
Preserve traceability back to the original file so users can verify the source.

The traceability point is important. Document intelligence should not become another black box. If the system claims a revision is outdated, a package is blocked, or a compliance step is missing, the user should be able to see why. Which document produced the claim? Which line, title block, metadata field or approval record supports it? Which later document superseded it?

This is where many automation projects go wrong. They jump straight from messy documents to polished reporting. The middle layer is missing. Without a structured intermediate representation, the dashboard is only a prettier version of the old confusion.

Spreadsheets deserve the same treatment. Many spreadsheet-led operations are really unacknowledged software systems. They contain data models, formulas, workflow states, approval logic, reporting assumptions and business rules. The problem is that all of this is embedded in cells, colours, tabs, comments and habits rather than explicit structure.

The sensible path is not always to ban spreadsheets. Sometimes the right move is to extract the stable parts into proper systems while preserving spreadsheets as flexible interfaces where they still make sense. Put the source of truth somewhere reliable. Keep ad hoc analysis lightweight. Do not let the workbook become the only place where the business knows what is happening.

Browser-first tooling can help here. Not every serious operational tool needs a heavy backend. Some workloads are well suited to local files, browser-side processing and WebAssembly. A user can open a folder, parse documents locally, generate structured outputs and export results without uploading sensitive project material to a remote service.

That model is particularly attractive when the workload is document parsing, validation, transformation or visual inspection. The browser becomes a safe execution environment. WebAssembly provides performance for the heavy parts. Workers keep the interface responsive. The backend is reserved for things that truly require central coordination, model inference, account management or shared storage.

Local-first does not mean primitive. It means the default ownership model is different. The user keeps the files. The tool reads and compiles them. The outputs are portable. Collaboration can be added deliberately rather than assumed from day one.

The deeper principle is this: document-led operations need a semantic layer. They need a way to move from files to facts, from facts to relationships, and from relationships to decisions. A dashboard should be the final surface, not the primary intelligence.

Once that layer exists, the business can ask better questions. Which documents are stale? Which approvals are missing? Which packages are blocked? Which records contradict each other? Which manual tracker is duplicating another source? Which process step is creating most of the delay?

That is the difference between digitising documents and operationalising them. Digitising makes the file easier to store. Operationalising makes the meaning easier to use.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles

Defensive Security Needs Reproducible Harnesses

Defensive analysis is most useful when it turns unknown behaviour into controlled reproduction: isolated environments, deterministic cases, clear traces and evidence engineers can act on.

Published by Crescita29 April 2026Security engineering

Defensive security work becomes useful when it moves from “that looks suspicious” to “we can reproduce the behaviour, isolate it, trace it and explain it”.

Security analysis often starts with fragments: a strange domain, a suspicious form, an unfamiliar WebSocket flow, a minified script, a payload that changes shape, or a browser interaction that appears to capture more than it should. The temptation is to poke at the live system manually and see what happens. That may satisfy curiosity, but it rarely produces durable engineering evidence.

A better pattern is to build a local harness. The goal of a harness is not to attack anything. It is to model observed behaviour in a controlled environment so the analyst can understand the protocol, record the state transitions, and identify the boundary between expected and unsafe behaviour.

Good defensive analysis is controlled reproduction, not vibes.

A useful harness starts by separating observation from interaction. Observation collects what the browser receives and sends: scripts, network calls, headers, payload formats, event names, timing, redirects and storage changes. Interaction should be limited, deliberate and safe. You want evidence, not collateral traffic.

For modern web applications, that often means modelling a frontend-to-backend message bus. A page may use WebSockets or a socket-style protocol to send login events, heartbeat events, acknowledgements, form updates and result messages. If the analyst can reproduce those messages locally, the behaviour becomes inspectable instead of mysterious.

The important word is reproduce. A one-off manual test is weak evidence. A case file that can be run repeatedly is much stronger. It allows another engineer to verify the finding, adjust assumptions, compare responses and confirm whether a change fixed the issue.

A practical defensive harness should usually capture:

The initial connection state, including origin, token format and negotiated protocol behaviour.
The exact messages sent by the client and received from the server.
The order of events, acknowledgements and heartbeats.
Any encoding, encryption, compression or transformation applied by the frontend.
A deterministic set of test cases with expected outcomes and clear failure output.

The harness should also make unsafe actions difficult. It should default to local targets, synthetic payloads and non-sensitive data. If live observation is required, it should be read-only wherever possible and tightly scoped. Defensive tooling should not create a second problem while investigating the first.

This is where engineering discipline matters. Security notes written as prose are easy to misread. Screenshots are easy to lose. Console logs without context are hard to replay. A harness gives the team an executable description of the behaviour.

That executable description becomes valuable beyond the initial investigation. It can be used to test a patch, monitor regression, document the protocol, train other engineers and distinguish actual vulnerabilities from noise. It also helps non-security stakeholders understand the issue because the evidence is structured rather than theatrical.

There is a broader lesson for software teams. Any workflow that matters should be reproducible. If a system fails only when a precise sequence of frontend events occurs, the team should have a way to replay that sequence. If a bug depends on timing, token state, message ordering or backend acknowledgement, the harness should make those conditions visible.

The best defensive security work is often calm and boring. It reduces ambiguity. It does not rely on bravado. It turns a suspicious interaction into a set of facts: what was sent, what was received, what changed, and why that matters.

That is the standard worth aiming for. Not dramatic probing. Not vague claims. Reproducible evidence, isolated execution and engineering output that can be acted on.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles

Adaptive Software as an Evolving Graph

Some software problems are better understood as adaptive networks: signals move, structures specialise, useful paths are reinforced and the system changes through feedback rather than static configuration.

Published by Crescita29 April 2026AI architecture

Not every intelligent system has to look like one large model call. Some problems are better approached as evolving networks of specialised relationships.

The common pattern in modern AI products is request in, model call, answer out. That is powerful, but it is also narrow. It treats intelligence as something concentrated inside a model rather than something that can emerge from the structure around it: memory, routing, feedback, tool use, context selection and adaptation over time.

An alternative way to think about adaptive software is as a graph. Nodes represent capabilities, memories, transformations, tools or concepts. Edges represent relationships, signal paths, trust, similarity, dependency or learned usefulness. Inputs move through the graph as signals. The system changes based on what happens to those signals.

This is not a claim that such a system is automatically conscious or human-like. It is a more modest engineering idea: adaptive behaviour can be modelled as structure plus feedback. The architecture remembers which paths were useful, which transformations produced value, which combinations failed, and where new specialisation is needed.

Learning is not only weight adjustment. At a systems level, learning can also be structural change.

A static pipeline decides its route in advance. An adaptive graph can choose a route based on the signal. A text input, image input, error log, business document or code change might activate different regions of the graph. Some nodes process. Some classify. Some transform. Some compare against memory. Some request help from tools. Some reinforce or weaken relationships based on validation.

The useful part of this idea is that it gives software a way to grow specialised internal structure. A node that repeatedly helps with a certain class of signal can become more central for that class. A path that repeatedly produces poor outputs can lose weight. Related nodes can cluster. New nodes can be created to handle recurring patterns that do not fit existing structures.

This resembles several ideas from complex systems without needing to copy biology literally. Biological metaphors are useful only when they force better engineering questions. What is the lifecycle of a node? How does a new capability inherit context from existing ones? When should a cluster split? When should it merge? What counts as feedback? What prevents uncontrolled growth?

A practical adaptive graph needs constraints. Without constraints, the system becomes noise with a memory leak. Useful constraints might include bounded node creation, explicit validation signals, decay for unused relationships, deterministic replay for experiments, and clear separation between hot memory, cold memory and external storage.

Storage matters because adaptive systems can become large quickly. If every signal creates permanent structure, the graph will drown in its own history. The system needs compression, pruning, summarisation and archival logic. It needs to distinguish between a one-off event, a useful pattern and a stable structural lesson.

Feedback also needs to be carefully defined. A human approving an answer is one type of feedback. A test passing is another. A tool call succeeding, a customer correcting an output, a downstream process accepting a result, or a route producing lower latency can all be feedback signals. The system should not treat them as the same thing.

This is where a graph architecture becomes interesting for practical AI tooling. Instead of relying only on prompt engineering, the system can build a memory of operational pathways. For a codebase, it might learn that certain modules, tests and deployment scripts belong together. For a document workflow, it might learn which files usually define project status. For an API platform, it might learn which logs, metrics and code paths are relevant to a class of incident.

The architecture can also remain model-agnostic. The graph does not have to be a replacement for language models, classifiers or search indexes. It can orchestrate them. A model can interpret a signal. A search index can retrieve context. A tool can act. The graph records the relationship between the situation, the selected route and the outcome.

The hardest part is not the idea. The hardest part is measurement. An adaptive system needs to prove that adaptation improves outcomes rather than merely creating complexity. That means controlled experiments, replayable datasets, baseline comparisons, bounded memory growth, and clear metrics for usefulness.

This kind of architecture sits somewhere between software engineering, graph theory, reinforcement learning and complex systems research. That is exactly why it is worth treating carefully. The concept is powerful, but it should be built through small empirical loops rather than grand claims.

The practical takeaway is simple: software systems can learn more than data. They can learn structure. They can learn routes. They can learn which tools belong together. They can learn where context should flow. If that learning is bounded, measurable and reversible, adaptive graph architecture becomes a serious design direction rather than just a metaphor.

Need this kind of thinking applied to your own system?

Crescita helps businesses turn brittle workflows, slow APIs, messy documents and unclear technical architecture into dependable operational systems.

Talk to Crescita Back to articles