Skip to content
Citations Logic Citations Logic
Blog About Request Access

The Infrastructure Bet Hidden Inside the AI Copyright Crisis

Alain Scemama
5 min read

AI systems can generate answers at scale. What they cannot do is prove it.

Prove what was used. When. Under which rights. And to whom compensation is owed.

That gap — between what AI systems do and what they can account for — is no longer a theoretical concern. It is becoming a legal, regulatory, and commercial liability for every company building on top of generative AI. And it points to something larger: the next infrastructure layer that AI actually needs, and that the industry has largely failed to build.


The Altman Signal

Sam Altman's reference to micropayments as a mechanism for AI agents to compensate publishers was not a throwaway line. It was an infrastructure bet dressed as a product comment.

The logic behind it is straightforward. If AI agents are going to pay for knowledge — and the direction of regulation, litigation, and commercial pressure makes this increasingly inevitable — then the industry doesn't just need better models. It needs plumbing.

Micropayments at machine scale require a full stack: identity, authorization, metering, pricing, settlement, auditability, and rights enforcement — operating in real time, across millions of simultaneous interactions. That stack does not yet exist as a coherent layer. There are pieces — payment rails, OAuth-style authorization protocols, blockchain settlement experiments — but no integrated infrastructure purpose-built for the way AI actually consumes content.

This is not unprecedented. Fintech went through the same structural moment: powerful use cases blocked not by lack of demand but by missing plumbing. Before APIs matured, before OAuth standardized authorization, before settlement rails could handle transaction volume at speed, the infrastructure gap was the business problem. The solution was not a better product. It was a new layer.


Pressure From Every Direction

The urgency is no longer abstract. Three converging forces are creating a deadline.

Regulatory: EU AI Act obligations for General Purpose AI systems begin applying in August 2026. Among the requirements: training data transparency, copyright compliance documentation, and — for providers above certain thresholds — auditable records of what content was used and under what rights. Most AI companies are not positioned to comply. Not because they lack legal teams, but because the underlying systems were never built to generate that record.

Litigation: Copyright cases against major AI developers are now reaching systemic scale. The questions being litigated — what was ingested, when, under which license or exception, and what compensation is owed — are infrastructure questions as much as legal ones. Courts are beginning to ask for evidence that most defendants cannot produce, not because it was destroyed, but because it was never captured.

Commercial: Publishers, news organizations, and content platforms are increasingly unwilling to accept opaque licensing arrangements. The market is moving toward verifiable usage records as a condition of access. The companies that can demonstrate auditable rights compliance will have access to content others cannot reach.

These three pressures are not sequential. They are simultaneous — and they are converging on the same core question: Can AI companies demonstrate what was used, when, how, and under which rights?

Most cannot. Not yet.


The instinct is to treat this as a compliance issue — hire a DPO, write better terms of service, negotiate blanket licenses. That instinct is understandable and wrong.

The compliance layer cannot be retrofitted onto systems that were never designed to generate the underlying records. You cannot audit what was never metered. You cannot compensate rights holders whose content was never identified. You cannot demonstrate traceability where no trace was kept.

This is the same reason early internet companies couldn't simply "add security" as an afterthought. Security wasn't a feature to be bolted on — it required architectural decisions made at the infrastructure level. Rights accountability for AI is the same class of problem.

The solution is not legal coverage. It is infrastructure: a rights-aware layer that sits between AI systems and the content they access, capable of metering usage, attributing sources, triggering micropayment settlement, and generating auditable compliance records — at the speed and scale at which AI actually operates.


What That Layer Looks Like

The components are knowable, even if the integrated stack doesn't yet exist:

  • Identity and attribution — which content, from which rights holder, accessed when

  • Usage metering — granular tracking at the level of individual interactions, not batch estimates

  • Rights verification — real-time checks against license status, opt-out registries, and TDM exceptions

  • Micropayment settlement — programmable compensation triggered by verified usage events, not periodic lump-sum negotiations

  • Auditability — immutable, timestamped records capable of satisfying regulatory and judicial scrutiny

None of these components are technically exotic. What is missing is the coherent layer that connects them, designed specifically for the consumption patterns of generative AI — not for human browsing, not for traditional SaaS API calls, but for the high-frequency, multi-source, often simultaneous content access that large language models and AI agents actually perform.


The Infrastructure Bet

The next competitive layer in AI will not be won by the model with the most parameters or the dataset with the most tokens. Those advantages are compressing. The models are converging.

What will differentiate the next generation of AI deployments is not raw capability. It is accountability. The ability to demonstrate, at any point, what was used, under what rights, with what compensation flowing to whom.

That accountability is not achievable through better contracts or smarter legal arguments. It requires infrastructure — a rights-aware layer built for machine scale, designed to make AI usage traceable, auditable, measurable, and compensable.

That is the infrastructure gap we are building to close at Citations LLC.

The question for every AI company deploying at scale — and for every publisher, regulator, and investor watching this space — is whether they will be caught flat-footed when that gap closes, or whether they will have already built the layer that comes next.