Opendata-Benchmark-Ita

Component description / functionalities

Dataset-OpenData-Benchmark-ITA is a multiple-choice benchmark dataset designed to evaluate the capability of Large Language Models (LLMs) to understand, retrieve, and reason over public Open Data published by European government portals. The current release focuses exclusively on Italian Open Data and is based on datasets published on the official Italian government portal, data.gov.it. Future releases will extend the benchmark to include harmonized governmental Open Data from additional European countries, starting with France, Spain, and Germany.

IPCEI CIS Reference Architecture

AI Layer

Data Layer

Open source license

ODC-BY-1.0

Keywords

open-data-benchmark

italian-benchmark-dataset

multiple-choice-qa

benchmark-dataset

llm-evaluation

italian-open-data

data-gov-it

public-sector-ai

government-open-data

dataset-understanding

metadata-reasoning

retrieval-based-qa

knowledge-benchmark

structured-qa

question-answering-dataset

italian-language-dataset

italian-nlp

european-open-data

real-world-data-benchmark

document-grounding

data-metadata-understanding

tabular-data-qa

csv-metadata-pairs

reasoning-benchmark

llm-testing

evaluation-suite

supervised-benchmark

curated-dataset

odc-by-license

villanova-ai

applied-nlp-benchmark

Visit the repository