Invention Graph — Benchmark Dataset & Evaluation

This repository contains the open-source evaluation toolkit released alongside the paper "From Rules to Neural Graphs: Scalable Structured Prediction for Patent Prior Art Search".

invention_graph_eval/        # Evaluation library (pip-installable)
│   ├── graph_model.py       # Graph/Node pydantic models
│   ├── human_readable.py    # ASCII graph rendering
│   ├── metrics.py           # SetMetrics, FeatureGraphMetrics
│   ├── loader.py            # Dataset loading utilities
│   └── tokenize.py          # Word-level tokenizer
data/
│   ├── evaluation.tar.gz    # 5,000-patent evaluation set (gold-standard)
│   └── train_sample.tar.gz  # 5,000-patent training sample
tests/                       # Unit tests (no data required)
evaluate.py                  # CLI evaluation script
requirements.txt
pyproject.toml

Quick Start

1. Install dependencies

pip install -e ".[dev]"
# or: pip install -r requirements.txt

2. Extract the datasets

The datasets are included as compressed archives in data/:

cd data
tar xzf evaluation.tar.gz
tar xzf train_sample.tar.gz

3. Run the test suite

python -m pytest tests/ -v

# Validate extracted data files
python -m pytest tests/test_data_validation.py --data-dir data/evaluation -v

4. Evaluate your model

python evaluate.py \
    --testset  data/evaluation \
    --predictions  my_predictions/ \
    --output   report.csv

Both --testset and --predictions are directories with the same layout:

<dir>/
    patent_meta.csv
    json/<ucid>.json

If the testset also contains a targets/ directory with word-level CSV labels, word coverage metrics are included in the output.

Dataset Format

Directory layout (after extraction)

<dataset_root>/
    patent_meta.csv             # metadata per patent
    json/<ucid>.json            # gold-standard graph (JSON)
    targets/<ucid>.csv          # word-level token labels

patent_meta.csv columns

Column	Description
`ucid`	Unique patent identifier
`id`	Internal integer ID
`family_id`	Patent family ID
`country`	2-letter country code
`ipc_codes`	IPC codes (pipe-separated)
`coverage`	Fraction of claim words in the graph
`length`	Number of tokens in the patent claim

targets CSV columns (word-level labels)

Column	Description
`id`	Token index
`text`	Token text
`parent`	Index of the parent token in the feature tree
`edge`	Edge type: `none`, `normal`, `meronym`, `hyponym`, `syntactic`, `method`, `reference`
`relation`	Index of the relation root token (0 if none)

Graph JSON format

The graph JSON follows the pydantic schema in invention_graph_eval/graph_model.py:

{
  "items": [
    {
      "id": 1,
      "type": "feature",
      "value": "pump",
      "items": [
        {
          "id": 2,
          "type": "relation",
          "value": [
            {"ref": 1},
            {"text": "connects to"},
            {"ref": 3}
          ],
          "items": []
        }
      ]
    }
  ]
}

Metrics

Metric	Description
`whole graph iou`	Mean IoU of the edge sets (predicted vs gold)
`whole graph iou@100`	Fraction of patents with perfect IoU (= 1.0)
`whole graph iou@90`	Fraction of patents with IoU ≥ 0.9
`whole graph p`	Mean precision of edge sets
`whole graph r`	Mean recall of edge sets
`targets coverage`	Fraction of input-text vocabulary in the gold-standard graph
`predictions coverage`	Fraction of input-text vocabulary in the predicted graph
`targets graph depth`	Mean branch depth of gold-standard graphs
`targets relation count`	Mean number of relation nodes in gold-standard graphs

The primary benchmark metric is whole graph iou.

Using the Library Programmatically

from invention_graph_eval.loader import load_dataset
from invention_graph_eval.metrics import FeatureGraphMetrics
from invention_graph_eval.graph_model import Graph, GraphWithNodeMap

# Load the evaluation set
meta, target_graphs, input_texts = load_dataset("data/evaluation")

# Build your predictions (list of GraphWithNodeMap, same order as meta)
# ...

metrics = FeatureGraphMetrics(predictions, target_graphs, input_texts)
print(metrics.stats())

License

See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Invention Graph — Benchmark Dataset & Evaluation

Contents

Quick Start

1. Install dependencies

2. Extract the datasets

3. Run the test suite

4. Evaluate your model

Dataset Format

Directory layout (after extraction)

patent_meta.csv columns

targets CSV columns (word-level labels)

Graph JSON format

Metrics

Using the Library Programmatically

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
invention_graph_eval		invention_graph_eval
tests		tests
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
evaluate.py		evaluate.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Invention Graph — Benchmark Dataset & Evaluation

Contents

Quick Start

1. Install dependencies

2. Extract the datasets

3. Run the test suite

4. Evaluate your model

Dataset Format

Directory layout (after extraction)

patent_meta.csv columns

targets CSV columns (word-level labels)

Graph JSON format

Metrics

Using the Library Programmatically

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages