Opendata-Benchmark-Ita

Component description / functionalities

Dataset-OpenData-Benchmark-ITA is a multiple-choice benchmark dataset designed to evaluate the capability of Large Language Models (LLMs) to understand, retrieve, and reason over public Open Data published by European government portals. The current release focuses exclusively on Italian Open Data and is based on datasets published on the official Italian government portal, data.gov.it. Future releases will extend the benchmark to include harmonized governmental Open Data from additional European countries, starting with France, Spain, and Germany. 

IPCEI CIS Reference Architecture

AI Layer
Data Layer

Open source license

ODC-BY-1.0
Keywords
open-data-benchmark
italian-benchmark-dataset
multiple-choice-qa
benchmark-dataset
llm-evaluation
italian-open-data
data-gov-it
public-sector-ai
government-open-data
dataset-understanding
metadata-reasoning
retrieval-based-qa
knowledge-benchmark
structured-qa
question-answering-dataset
italian-language-dataset
italian-nlp
european-open-data
real-world-data-benchmark
document-grounding
data-metadata-understanding
tabular-data-qa
csv-metadata-pairs
reasoning-benchmark
llm-testing
evaluation-suite
supervised-benchmark
curated-dataset
odc-by-license
villanova-ai
applied-nlp-benchmark