italiainnovation. \\ polibio.ai v0.1 · tools & methods
An Italia Innovation tools & methods release Companion paper · SSRN, forthcoming

Agentic AI for field research and inductive theory in the social sciences.

An open template for building a domain-specific agentic research system on Claude Code. It iterates hypotheses, builds datasets to the design, prepares fieldwork, and probes findings with theory and quantitative evidence — it is scaffolding you fill with your field's commitments, sources, and canon.

Scope
Sociology · political economy · economic history · law & economics
Built on
Claude Code · MCP · DuckDB · public APIs
License
MIT · proposed
_ why

Two failures of general-purpose AI for empirical research.

General-purpose models apply every theoretical tradition with equal weight, and they leave intact the oldest constraint of empirical work: the data dictates the study.

01

General-purpose AI flattens the field.

Models re-derive the analytical apparatus from scratch each time, weighting traditions equally. A discipline's commitments, moves, and canon never enter the loop — and so never shape the work.

02

The available data dictates the study.

Without machinery to build evidence to a design, the researcher bends the question to whatever statistics happen to exist. The study becomes a function of the dataset, not the inquiry.

Polibio.ai inverts both.

field operations in the system datasets built to the design
_ lifecycle

Four phases, in order.

Discover and gather before interpreting. Theory before statistics. Fieldwork between C and D — not in the system, but in the researcher's hands.

A
discovery

Iterate hypotheses

A conjecture meets a networked evidence base — APIs and MCP — and is revised in minutes, not weeks.

B
design

Build datasets to the design

The study specifies variables, units, and period; the system assembles them across sources with full provenance.

C
interpretation

Prepare fieldwork

Triangulate preliminary evidence with literature — your library first — and the documentary record into an operational plan.

fieldworkyours.
D
probing

Probe with theory, then evidence

Apply your field's frameworks first, then quantitative tests. The system returns a diagnosis, never a verdict.

theory precedes statistics · within phase D diagnosis, not verdict · consistent · inconsistent · inconclusive · surprising
_ what it is

Scaffolding you fill.

Polibio.ai finds, builds, prepares, and probes. What it discloses is scaffolding; what you bring is your field — its commitments, moves, sources, and canon.

Disclosed: the released scaffolding.

Four analytical layers and their agents, the slash commands, the data toolkit, the provenance-logged DataPool, and the CLAUDE.md skeleton.

Mutable: what you bring.

Your field's intellectual commitments, its analytical moves, its evidence sources, and its canon — the "interlocutor library." These are the three sections of CLAUDE.md you rewrite.

solid · released dashed · you bring
L1Discovery agents · slash commandsscaffolding
L2Data toolkit · DataPool · provenancescaffolding
L3Interpretation & probing layersscaffolding
— —Intellectual commitmentsyour field
— —Analytical movesyour field
— —Evidence sources & canonyour field
_ data toolkit

Provenance-first.

A set of connectors to common social-science sources. Every observation traces to the call that produced it. Usable standalone, even without Claude Code.

FRED· Eurostat· OECD· World Bank· BLS· US Census· QCEW· ISTAT· NOMIS· ONS· Companies House· Land Registry· Charity Commission· INE· Banco de España· Catastro· EPO patents· DART

The DataPool guarantee

Everything lands in a local DuckDB DataPool whose schema enforces — by design — the conditions for empirical work that can be audited and reproduced.

  • Every observation traces to a logged retrieval.
  • No imputation without a recorded method.
  • Raw values are never overwritten.
  • Source conflicts are preserved, not resolved away.
_ what it produces

From conjecture to diagnosis.

The phases leave a trail of auditable artifacts — provenance-logged datasets, charts, a probe diagnosis. Below is the output of one worked example: Italy's industrial districts in the AI–robotics transition, the case the companion paper develops. The phases turned a lagging-adoption story into a question about absorption — and built the evidence to ask it well.

Manufacturing TFP corridors% per year · Penn World Table 11.0, ARDECO
The adoption paradoxItaly as % of benchmark, 100 = parity · IFR / OWID, Eurostat
The frontier-patent inversionshare of frontier robotics, CPC B25J · EPO
The district employment paradox2012→2023, % change · ISTAT, ASIA–BvD
Worked example · 16 probes · 18 data connectors · every series provenance-logged
Phase D returns adiagnosis, not a verdict.
consistent inconsistent inconclusive surprising
_ adapt it to your field

Where the approach travels.

The pattern is general. The included example — institutional law & economics — is one instantiation, not the contribution. Three conditions apply.

01 · structure

Decomposable operations.

Your field's analysis separates into discrete moves that chain. An agent can be given one move at a time and held to it.

02 · evidence

Accessible evidence.

Your sources are queryable — or at least machine-readable. If the data is in a building, no system substitutes for getting on a plane.

03 · order

A sequenceable epistemology.

There is a defensible order in which the work proceeds — discovery before interpretation, theory before statistics — and you can name it.

_ read · cite

Two doors in.

A paper and a repository. Pick the one that matches how you read systems.

paper

Read the companion paper.

Polibio.ai — Agentic AI for Field Research and Inductive Theory in the Social Sciences.

VenueSSRN
StatusForthcoming
AuthorsItalia Innovation
Read on SSRN
code

Get the repository.

The four-layer scaffolding, the slash commands, the data toolkit, the DataPool, and the CLAUDE.md skeleton. Fork and fill.

Repogithub.com/italiainnovation/polibio
LicenseMIT · proposed
CiteCITATION.cff
Open on GitHub