Complex Semantic Tabular Interpretation using SDD-Gen

No Thumbnail Available
Authors
Johnson, Matthew
Stingone, Jeanette
Bengoa, Sofia
Masters, James
McGuinness, Deborah L.
Issue Date
2024-03-24
Type
Article
Language
en_US
Keywords
Research Projects
Organizational Units
Journal Issue
Alternative Title
Abstract
Knowledge graphs have become an essential technology for both businesses and governments. They enable a wide variety of critical tasks, such as aligning diverse datasets, improving the capabilities of search engines, supporting error checking, and generating explanations using inference engines. However, populating, augmenting, and/or validating a knowledge graph can be challenging because developers need domain knowledge to understand their data and experience in ontology modeling to align concepts properly as well as experience with conflict detection and truth maintenance tools. Previous efforts have explored automatically integrating simple tabular data into knowledge graphs to lower the barrier to entry. These methods heavily rely on named entity overlap and require that tables are similar to relational tables in third normal form. While these methods have been successful under competition, these limitations make them impractical for general usage. In this paper, we introduce the semantic data dictionary generator (SDD-Gen), an algorithm that aligns complex tabular data to ontological terms for knowledge graph generation. Our methodology leverages context information from data dictionaries to make alignments, enabling us to align complex tables with few named entities and multiple subject columns.
Description
Full Citation
Johnson M, Stingone JA, Bengoa S, Masters J, and McGuinness DL. “Complex Semantic Tabular Interpretation using SDD-Gen,” in IEEE 18th International Conference on Semantic Computing (ICSC). IEEE, 2024.
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Journal
Volume
Issue
PubMed ID
DOI
ISSN
EISSN