DOI: https://doi.org/10.11649/cs.1811

Lexical platform – the first step towards user-centred integration of lexical resources

Maciej Piasecki, Tomasz Walkowiak, Ewa Rudnicka, Francis Bond

Abstract


Lexical platform – the first step towards user-centred integration of lexical resources

Lexical platform – the first step towards user-centred integration of lexical resources The paper describes the Lexical Platform - a means for lightweight integration of independent lexical resources. Lexical resources (LRs) are represented as web components that implement a minimal set of predefined programming interfaces. These provide functionality for querying and generate a simple, common presentation format. Therefore, a common data format is not needed and the identity of component LRs is preserved. Users can search, browse and navigate via resources on the basis of a limited set of anchor elements such as base form, word form and synset id.

 

Platforma leksykalna – pierwszy krok w kierunku integracji zasobów leksykalnych zorientowanej na użytkowników

Artykuł opisuje Platformę Leksykalną – sposób na lekką integrację niezależnych zasobów leksykalnych. Zasoby leksykalne są na niej reprezentowane jako komponenty webowe, które implementują minimalny zestaw predefiniowanych interfejsów programistycznych. Interfejsy te dostarczają funkcjonalność do przeszukiwania oraz generują prosty, jednolity format prezentacji zasobów. W związku z tym wspólny format danych nie jest konieczny i tożsamość składowych zasobów leksykalnych jest zachowana. Użytkownicy mogą przeszukiwać zasoby na podstawie ograniczonego zbioru odwołań takich jak forma podstawowa, forma wyrazowa i identyfikator synsetu.


Keywords


lexical resources; wordnet; interoperability of lexical resources

Full Text:

PDF (in English)

References


Aguado-de Cea, G., Montiel-Ponsoda, E., Kernerman, I., & Ordan, N. (2016). From dictionaries to crosslingual lexical resources. Kernerman Dictionary News, 24, 25-31.

Bel, N. (2010). Platform for automatic, normalized annotation and cost-effective acquisition of language resources for human language technologies. PANACEA. Procesamiento del Lenguaje Natural, 45, 327-328. Retrieved from http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/ article/view/826

Bell, M. (2012). SOA Modeling Patterns for Service-oriented Discovery and Analysis. Hoboken, NJ: John Wiley & Sons, Inc. https://doi.org/10.1002/9781119202905.part1

Bentivogli, L., Forner, P., Magnini, B., & Pianta, E. (2004). Revising wordnet domains hierarchy: Semantics, coverage, and balancing. In COLING 2004 Workshop on "Multilingual Linguistic Resources", Geneva, Switzerland, August 28 (pp. 101-108). ACL. https://doi.org/10.3115/1706238.1706254

Bond, F., & Foster, R. (2013). Linking and extending an open multilingual wordnet. In 51st Annual Meeting of the Association for Computational Linguistics: ACL-2013, Sofia (pp. 1352-1362). Association for Computational Linguistics.

Bond, F., & Paik, K. (2012). A survey of wordnets and their licenses. In C. Fellbaum, & P. T. J. M. Vossen (Eds.), Proceedings of the 6th Global WordNet Conference, Matsue, Japan (pp. 64-71). Brno: Tribun EU. The Global WordNet Association. Matsue. Retrieved from http://web.mysites.ntu.edu.sg/fcbond/open/pubs/2012-gwc-wn-license.pdf

Bond, F., Vossen, P., McCrae, J., & Fellbaum, C. (2016). CILI: The collaborative interlingual index. In V. Barbu Mititelu, C. Forăscu, C. Fellbaum, & P. Vossen (Eds.), Proceedings of the 8th Global Wordnet Conference (pp. 280-289). Bucharest: Global Wordnet Association. http://jiangbian.me/papers/2016/gwc2016.pdf

Calzolari, N., Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J., & Piperidis, S. (Eds.). Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, Reykjavík, Iceland, 2014. ELRA. http://www.lrec-conf.org/proceedings/lrec2014/index.html

Crane, D., Pascarello, E., & James, D. (2005). Ajax in action. Greenwich, CT: Manning Publications Co.

Dragoni, N., Giallorenzo, S., Lafuente, A., Mazzara, M., Montesi, F., Mustafin, R., & Safina, L. (2017). Microservices: Yesterday, today, and tomorrow. In M. Mazzara & B. Meyer (Eds.), Present and ulterior software engineering (pp. 195-216). Springer. https://doi.org/10.1007/978-3-319-67425-4_12

Eckle-Kohler, J., Gurevych, I., Hartmann, S., Matuschek, M., & Meyer, C. M. (2013). UBY-LMF - Exploring the boundaries of language-independent lexicon models. In LMF Lexical Markup Framework (pp. 145-156). Hoboken, NJ: John Wiley & Sons, Inc.

https://doi.org/10.1002/9781118712696.ch10

Fellbaum, C. (Ed.). (1998). WordNet: An electronic lexical database. Cambridge, MA: MIT Press.

Henrich, V., & Hinrichs, E. (2010). Standardizing wordnets in the ISO standard LMF: Wordnet-LMF for GermaNet. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010) (pp. 456-464). Beijing: Coling 2010 Organizing Committee. Retrieved from http://www.aclweb.org/anthology/C10-1052

Ishida, T. (Ed.). (2011). The language grid: Service-oriented collective intelligence for language resource interoperability. Springer. https://doi.org/10.1007/978-3-642-21178-2

Kędzia, P., & Piasecki, M. (2014). Ruled-based, interlingual motivated mapping of plWordNet onto SUMO ontology. In N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proc. Ninth International Conference on Language Resources and Evaluation (LREC'14) (pp. 4351-4358). European Language Resources Association. Retrieved from http://www.lrec-conf.org/proceedings/lrec2014/index.html

Klimek, B., & Brümmer, M. (2015). Enhancing lexicography with semantic language databases. Kernerman DICTIONARY News, 23, 5-10. Retrieved from http://svn.aksw.org/papers/2015/ KDICT_Lexicography/public.pdf

Maziarz, M., Piasecki, M., Rudnicka, E., Szpakowicz, S., & Kędzia, P. (2016). plwordnet 3.0 - a comprehensive lexical-semantic resource. In N. Calzolari, Y. Matsumoto, & R. Prasad (Eds.), COLING 2016: 26th International Conference on Computational Linguistics: Proceedings of the Conference: Technical Papers: December 11-16, 2016, Osaka, Japan (pp. 2259-2268). Retrieved from http://aclweb.org/anthology/C/C16/

McCrae, J., Montiel-Ponsoda, E., & Cimiano, P. (2012). Integrating WordNet and Wiktionary with lemon (pp. 25-34). Berlin: Springer. https://doi.org/10.1007/978-3-642-28249-2_3

Měchura, M. (2012). Léacslann: a platform for building dictionary writing systems. In R. V. Fjeld & J. M. Torjusen (Eds.), Proceedings of the 15th EURALEX International Congress (pp. 855-861). Oslo: Department of Linguistics and Scandinavian Studies, University of Oslo.

Měchura, M. (2016). Data structures in lexicography: from trees to graphs. In A. Rambousek, A. Horák, & P. Rychlý (Eds.), Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2016 (pp 97-104). Brno: Tribun EU. Retrieved from http://nlp.fi.muni.cz/raslan/2016/paper04-Mechura.pdf

Pease, A. (2011). Ontology: A practical guide. Angwin, CA: Articulate Software Press.

Pease, A., & Fellbaum, C. (2010). Formal ontology as interlingua: the SUMO and WordNet linking project and Global WordNet. In C.-R. Huang, N. Calzolari, A. Gangemi, A. Oltramari, & L. Prévot (Eds.), Ontology and the lexicon: A natural languge processing perspective, studies in natural language processing. Cambridge: Cambridge University Press. https://doi.org/10.1017/CBO9780511676536.003

Pedersen, B. S., Nimb, S., Asmussen, J., Sørensen, N. H., Trap-Jensen, L., & Lorentzen, H. (2009). DanNet: The challenge of compiling a wordnet for Danish by reusing a monolingual dictionary. Language Resources and Evaluation, 43. https://doi.org/10.1007/s10579-009-9092-1

Pęzik, P. (2014). Graph-based analysis of collocational profiles. In V. Jesensek & P. Grzybek (Eds.), Phraseologie im Wörterbuch und Korpus (Phraseology in Dictionaries and Corpora) (pp. 227-243). Maribor: Filozofska fakuteta.

Piasecki, M., Czachor, G., & Kędzia, P. (2018). Wordnet-based evaluation of large distributional models for polish. In F. Bond, C. Fellbaum, & P. Vossen (Eds.), Proceedings of the 9th Global Wordnet Conference (GWC 2018). Global WordNet Association.

Przepiórkowski, A., Hajnicz, E., Patejuk, A., Woliński, M., Skwarski, F., & Świdziński, M. (2014). Walenty: Towards a comprehensive valence dictionary of Polish. In N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), LREC 2014, Ninth International Conference on Language Resources and Evaluation (pp. 2785-2792). Retrieved from http://www.lrec-conf.org/proceedings/lrec2014/index.html

Richardson, C. (2018). What are microservices?. https://microservices.io/

Rudnicka, E., Bond, F., Grabowski, Ł., Piasecki, M., & Piotrowski, T. (2018). Lexical perspective on wordnet to wordnet mapping. In F. Bond, T. Kuribayashi, C. Fellbaum, & P. Vossen (Eds.), Proceedings of the 9th Global Wordnet Conference, Singapore, 8-12 January 2018 (pp. 210-219). Singapore: Global WordNet Association. Retrieved from http://compling.hss.ntu.edu.sg/events/2018-gwc/pdfs/gwc-2018-proceedings.pdf.

Saloni, Z., Woliński, M., Wołosz, R., Gruszczyński, W., & Skowrońska, D. (2012). Słownik gramatyczny języka polskiego [A grammatical dictionary of Polish]. Warszawa: Warsaw University.

Videla, A., & Williams, J. (2012). RabbitMQ in action: Distributed messaging for everyone. Shelter Island, NY: Manning.

Vinoski, S. (2006). Advanced message queuing protocol. IEEE Internet Computing, 10(6). https://doi.org/10.1109/MIC.2006.116

Vossen, P. T. J. M., Soria, C., & Monachini, M. (2013). Wordnet-LMF: A standard representation for multilingual wordnets. In G. Francopoulo, & P. Paroubek (Eds.), LMF Lexical Markup Framework. Hermess/Lavoisier. https://doi.org/10.1002/9781118712696.ch4

Wolff, E. (2016). Microservices: Flexible Software Architectures. Boston, MA: Addison-Wesley Professional.

Zaśko-Zielińska, M., & Piasecki, M. (2018). Towards emotive annotation in plWordNet 4.0. In F. Bond, T. Kuribayashi, C. Fellbaum, & P. Vossen (Eds.), Proceedings of the 9th Global Wordnet Conference, Singapore, 8-12 January 2018 (pp. 154-163). Singapore: Global WordNet Association. Retrieved from http://compling.hss.ntu.edu.sg/events/2018-gwc/pdfs/gwc-2018-proceedings.pdf




Copyright (c) 2018 Maciej Piasecki, Tomasz Walkowiak, Ewa Rudnicka, Francis Bond

License URL: http://creativecommons.org/licenses/by/3.0/pl/