A modular framework for ontology learning from text in Portuguese

Authors

DOI:

https://doi.org/10.33837/msj.v3i3.899

Keywords:

Semiautomatic Ontology Learning, Public Security Ontology, Natural Language Processing, Taxonomic relations.

Abstract

Research on ontology learning has been carried out in many knowledge areas, especially in Artificial Intelligence. Semi-automatic or automatic ontology learning can contribute to the field of knowledge representation. Many semi-automatic approaches to ontology learning from texts have been proposed. Most of these proposals use natural language processing techniques. This paper describes a computational framework construction for semi-automated ontology learning from texts in Portuguese. Axioms are not treated in this paper. The work described here originated from the Philipp Cimiano’s proposal along with text standardization mechanisms, natural language processing, identification of taxonomic relations and techniques for structuring ontologies. In this work, a case study on public security domain was also done, showing the benefits of the developed computational framework. The result of this case study is an ontology for this area.

References

Baségio, T. L. & de Lima, V. L. S. (2006). Semi-automatically building ontological structures from portuguese written texts. In Vieira, R., Quaresma, P., Nunes, M. d. G. V., Mamede, N. J., Oliveira, C., and Dias, M. C., editors, Computational Processing of the Portuguese Language, volume 3960 of Lecture Notes in Computer Science, 208–211. Springer Berlin Heidelberg.

Brank, J., Grobelnik, M., and Mladenic, D. (2005). A survey of ontology evaluation techniques. In Proc. of 8th Int. multi-conf. Information Society, 166–169.

Cao, Y., Wang, X., Zhang, F., and Yang, W. (2012). Ontology-based domain knowledge acquisition technology. In Computational Intelligence and Design (ISCID), 2012 Fifth International Symposium on, 2, 487–490.

Cimiano, P. (2006). Ontology Learning and Population from Text: Algorithms, Evaluation and Applications. Springer-Verlag New York, Inc., Secaucus, NJ, USA.

Fayad, M. E., Schmidt, D. C., and Johnson, R. E. (1999). Implementing Application Frameworks: Object-oriented Frameworks at Work. John Wiley & Sons, Inc., New York, NY, USA.

Gamma, E., Helm, R., Johnson, R., and Vlissides, J. (1995). Design Patterns: Elements of Reusable Object-oriented Software. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA.

Ghisi, F. B., Fachin, G. R. B., Santos, M. H. d., Sell, D., and Rados, G. J. V. (2012). A reference ontology for digital scientific journals applied to systematic literature review processes. Transinformação, 24, 91-101.

Gonçalves, G., Wilkens, R., and Villavicencio, A. (2011). Sistema de aquisição semi-automatica de ontologias. In Vieira, R., Guizzardi, G., and Fiorini, S. R., editors, ONTOBRAS-MOST, of CEUR Workshop Proceedings, 776, 189–194. CEUR-WS.org.

Gruber, T. (1995). Toward principles for the design of ontologies used for knowledge sharing. International Journal Human-Computer Studies, 43(5-6), 907-928.

Hearst, M. A. (1992). Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th Conference on Computational Linguistics - Volume 2, COLING’92, 539–545, Stroudsburg, PA, USA. Association for Computational Linguistics.

Jena (2014). The apache jena project. version 2.12, 2014. Available in: http://jena.apache.org/. Last seen in December 2014.

Junior, L. C. R. (2008). Ontolp: Construção semi-automática de ontologias a partir de textos da língua portuguesa. (Dissertação de Mestrado em Computação Aplicada) – Universidade do Vale do Rio dos Sinos, São Leopoldo. Available in: http://goo.gl/LFutcI. Last seen in April 2013.

Lopes, L., Fernandes, P., Vieira, R., and Fedrizzi, G. (2009). Exatolp - an automatic tool for term extraction from portuguese language corpora. In Proceedings of the LTC’09, Pozna, Poland.

Lopes, L., Vieira, R., Fernandes, P., and Couto, G. (2012). Exatolp - an automatic tool for term extraction from portuguese language corpora. In International Conference on Computational Processing of the Portuguese Language - PROPOR, 45–47.

Lowagie, B. (2010). iText in Action. Manning Publications Co., Greenwich, CT, USA.

Maedche, A. & Staab, S. (2001). Ontology learning for the semantic web. IEEE Intelligent Systems, 16(2), 72–79.

Moraes, S., and Lima, V. (2012). Combining formal concept analysis and semantic infor- mation for building ontological structures from texts : an exploratory study. In Calzo- lari, N., Choukri, K., Declerck, T., Dog˘an, M. U., Maegaard, B., Mariani, J., Odijk, J., and Piperidis, S., editors, Proceedings of the Eighth International Conference on Lan- guage Resources and Evaluation (LREC-2012), pages 3653–3660, Istanbul, Turkey. European Language Resources Association (ELRA). ACL Anthology Identifier: L12- 1556.

Motta, E. N. (2009). Preenchimento semi-automático de ontologias de domínio a partir de textos em língua portuguesa. (Dissertação de Mestrado em Informática) – Centro de Ciências exatas em tecnologia, Universidade Estadual do Rio de Janeiro, Rio de Janeiro. Available in: <http://www2.uniriotec.br/ppgi/banco-de-dissertacoes-ppgi-unirio/ano-2009/preenchimento-semi-automatico-de-ontologias-de-dominio-a-partir-de-textos-em-lingua-portuguesa/view>. Last seen in April 2014.

Opennlp (2010). Toolkit for the processing of natural language text. version 1.5, 2010. Avail- able in: http://opennlp.apache.org. Last seen in November 2014.

Press, O. U. (2010). Oxford American Desk Dictionary & Thesaurus. Oxford University Press, USA.

Surhone, L., Tennoe, M., & Henssonow, S. (2010). Apache Poi. Betascript Publishing.

Wong, W., Liu, W., & Bennamoun, M. (2012). Ontology learning from text: A look back and into the future. ACM Comput. Surv., 44(4), 20:1–20:36.

Zahra, F. M. (2009). Poronto - ferramenta para construção semiautomática de ontologias em português. (Dissertaçãode Mestrado em Tecnologia em Saúde) – CCBS, PUCPR, Curitiba. Available in: http://www.dominiopublico.gov.br/. Last seen in April 2014.

Downloads

Published

2020-10-19

How to Cite

Guimarães, N. C., & de Carvalho, C. L. (2020). A modular framework for ontology learning from text in Portuguese. Multi-Science Journal (ISSN 2359-6902), 3(3), 37-42. https://doi.org/10.33837/msj.v3i3.899

Issue

Section

Other Areas of Knowledge