SOME APPROACHES FOR MAPPING WIKIPEDIA INFOBOX PROPERTIES TO WIKIDATA

Tạ Hoàng Thắng

Abstract


Wikidata is an open, online database which stores the common resources of other Wikimedia projects. Unifying Wikipedia infoboxes was described in Phase II of Wikidata plan which aims to augment auto-translation to Wikipedia infobox templates and deals with the diversity of Infobox data in all languages. In this paper, we offer some approaches to map Infobox properties to Wikidata for improving our enrichment model. Our results can be a valuable resource for Wikidata to alignInfobox properties. We mainly focus on how to map Vietnamese and English properties to Wikidata.

Keywords


DBPedia; Infobox Property; Mapping; Wikidata; Wikipedia.

References


Vrandečić, D., & Krötzsch, M. Wikidata: a free collaborative knowledgebase. Communications of the ACM, 57(10), 78-85. (2014).

Erxleben, F., Günther, M., Krötzsch, M., Mendez, J., & Vrandečić, D. . Introducing Wikidata to the linked data web. In The Semantic Web–ISWC 2014 (pp. 50-65). Springer International Publishing. (2014).

Ta, T. H., & Anutariya, C. A Model for Enriching Multilingual Wikipedias Using Infobox and Wikidata Property Alignment. In Semantic Technology (pp. 335-350). Springer International Publishing. (2014).

Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., ... & Bizer, C.. DBpedia–A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web. (2014).

Aprosio, A. P., Giuliano, C., & Lavelli, A.. Automatic Mapping of Wikipedia Templates for Fast Deployment of Localised DBpedia Datasets. In Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies (p. 1). ACM. (2013)

Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., & Hellmann, S. DBpedia-A crystallization point for the Web of Data. Web Semantics: science, services and agents on the world wide web, 7(3), 154-165. (2009).

Thanh Nguyen, Viviane Moreira, Huong Nguyen, Hoa Nguyen and Juliana Freire. Multilingual schema matching for Wikipedia infoboxes. Proceedings of the VLDB En-dowment, Volume 5 Issue 2, October 2011, Pages 133-144, (2011).

Eytan Adar, Michael Skinner and Daniel S. Weld. Information Arbitrage Across Multi-lingual Wikipedia. WSDM '09 Proceedings of the Second ACM International Con-ference on Web Search and Data Mining. Pages 94-103, (2009).

Navigli, R., & Ponzetto, S. P.. BabelNet: Building a very large multilingual semantic network. In Proceedings of the 48th annual meeting of the association for compu-tational linguistics (pp. 216-225). Association for Computational Linguistics. (2010, July)

Bouma, G., Duarte, S., & Islam, Z. Cross-lingual alignment and completion of Wikipedia templates. In Proceedings of the Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (pp. 21-29). Association for Computational Linguistics. (2009, June).

Wu, F., & Weld, D. S.. Automatically refining the wikipedia infobox ontology. In Proceedings of the 17th international conference on World Wide Web (pp. 635-644). ACM. (2008, April)

Schulze, B. M. U.S. Patent No. 6,167,369. Washington, DC: U.S. Patent and Trademark Office. (2000).

Schmitt, J. C. U.S. Patent No. 5,062,143. Washington, DC: U.S. Patent and Trademark Office. (1991).

Vietnamese Wordnet. (n.d.). Retrieved April 04, (2016), from http://viet.word-net.vn/wnms




DOI: http://dx.doi.org/10.37569/DalatUniversity.6.2.41(2016)

Refbacks

  • There are currently no refbacks.


Copyright (c) 2016 Tạ Hoàng Thắng

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Editorial Office of DLU Journal of Science
Room.15, A25 Building, 01 Phu Dong Thien Vuong Street, Dalat, Lamdong
Email: tapchikhoahoc@dlu.edu.vn - Phone: (+84) 263 3 555 131

Creative Commons License
Based on Open Journal Systems
Developed by Information Technology Department