Towards the Automatic Learning of Ontologies
- Resource Type
- Conference
- Authors
- Ocampo-Guzman, Isidra; Lopez-Arevalo, Ivan; Tello-Leal, Edgar; Sosa-Sosa, Victor
- Source
- 2009 Seventh Brazilian Symposium in Information and Human Language Technology Information and Human Language Technology (STIL), 2009 Seventh Brazilian Symposium in. :191-197 Sep, 2009
- Subject
- Computing and Processing
Communication, Networking and Broadcast Technologies
General Topics for Engineers
Ontologies
Visualization
Humans
Information management
Content addressable storage
Ontology construction
Latent Dirichlet Allocation
WordNet
- Language
This paper proposes a methodology for the automatic learning of ontologies from a text corpus. The concepts (topics) from documents into the corpus are identified by using the Latent Dirichlet Allocation model. Based on theset of identified topics, for each concept it is constructed its taxonomy by using the terms with greater probability which contribute to define it. WordNet is usedin the construction of these partial topic taxonomies by obtaining the similarity and relatedness between the terms that constitute each topic. The resulting taxonomies are joined to structure the final ontology. The methodology is evaluated with the Lonely Planet corpus.