Extending Coverage of a Lexicon of Discourse Connectives Using Annotation Projection

Jiří Mírovský, Pavlína Synková, Lucie Poláková


  1. Jiří Mírovský, Pavlína Jínová, and Lucie Poláková. Discourse Relations in the Prague Dependency Treebank 3.0 In The 25th International Conference on Computational Linguistics (Coling 2014), Proceedings of the Conference System Demonstrations, pages 34–38, Dublin City University (DCU), Dublin, Ireland, 2014.
  2. Jiří Mírovský, Lucie Poláková, and Jan Štěpánek. Searching in the Penn Discourse Treebank Using the PML-Tree Query In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pages 1762–1769, European Language Resources Association, Paris, France, 2016.
  3. Jiří Mírovský, Pavlína Synková, Magdaléna Rysová, and Lucie Poláková. CzeDLex – A Lexicon of Czech Discourse Connectives The Prague Bulletin of Mathematical Linguistics, pages 61–91, Prague, Czech Republic, 2017.
  4. Petr Pajas and Jan Štěpánek. Recent Advances in a Feature-rich Framework for Treebank Annotation In Proceedings of the 22nd International Conference on Computational Linguistics, pages 673–680, The Coling 2008 Organizing Committee, Manchester, 2008. (http://doi.org/10.3115/1599081.1599166)
  5. Petr Pajas and Jan Štěpánek. System for Querying Syntactically Annotated Corpora In Proceedings of the ACL–IJCNLP 2009 Software Demonstrations, pages 33–36, Association for Computational Linguistics, Suntec, 2009. (http://doi.org/10.3115/1667872.1667881)
  6. Magdaléna Rysová and Kateřina Rysová. The Centre and Periphery of Discourse Connectives In Proceedings of Pacific Asia Conference on Language, Information and Computing, pages 452–459, Bangkok, 2014.
  7. Magdaléna Rysová, Pavlína Synková, Jiří Mírovský, Eva Hajičová, Anna Nedoluzhko, Radek Ocelák, Jiří Pergler, Lucie Poláková, Veronika Pavlíková, Jana Zdeňková, and Šárka Zikánová. Prague Discourse Treebank 2.0, ÚFAL MFF UK, Prague, Czech Republic, 2016.
  8. Pavlína Synková, Lucie Poláková, Jiří Mírovský, and Magdaléna Rysová. CzeDLex~0.6, Data/Software, Charles University, ÚFAL MFF UK, Prague, Czech Republic, http://hdl.handle.net/11234/1-3074, 2019.
  9. Pavlína Synková, Magdaléna Rysová, Lucie Poláková, and Jiří Mírovský. Extracting a Lexicon of Discourse Connectives in Czech from an Annotated Corpus In Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation, pages 232–240, University of the Philippines Cebu, Cebu, Philippines, 2017.
  10. Ondřej Bojar, Jiří Mírovsk\`y, Kateřina Rysová, and Magdaléna Rysová. Evald Reference-less Discourse Evaluation for WMT18 In Proceedings of the Third Conference on Machine Translation: Shared Task Papers, pages 541–545, 2018. (http://doi.org/10.18653/v1/W18-6432)
  11. Antonio Briz, Salvador Pons Bordería, and José Portolés. Diccionario de partículas discursivas del español, Data/software, www.dpde.es. Online since 2003, 2003.
  12. Lynn Carlson, Mary Ellen Okurowski, Daniel Marcu, and others. RST Discourse Treebank, Linguistic Data Consortium, University of Pennsylvania, Philadelphia, 2002.
  13. Bruno Cartoni, S. Zufferey, and T. Meyer. Annotating the meaning of discourse connectives by looking at their translation: The translation-spotting technique Dialogue Discourse 4, pages 65-86, 2013. (http://doi.org/10.5087/dad.2013.204)
  14. Debopam Das, Tatjana Scheffler, Peter Bourgonje, and Manfred Stede. Constructing a Lexicon of English Discourse Connectives In Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, pages 360–365, 2018. (http://doi.org/10.18653/v1/W18-5042)
  15. Anna Feltracco, Elisabetta Jezek, Bernardo Magnini, and Manfred Stede. LICO: A Lexicon of Italian Connectives CLiC it, pages 141, 2016. (http://doi.org/10.4000/books.aaccademia.1770)
  16. Jan Hajič, Eva Hajicová, Jarmila Panevová, Petr Sgall, Ondrej Bojar, Silvie Cinková, Eva Fucíková, Marie Mikulová, Petr Pajas, Jan Popelka, and others. Announcing Prague Czech–English Dependency Treebank 2.0. In LREC, pages 3153–3160, 2012.
  17. Rebecca Hwa, Philip Resnik, Amy Weinberg, Clara Cabezas, and Okan Kolak. Bootstrapping Parsers via Syntactic Projection across Parallel Texts Natural language engineering 11, pages 311–326, Cambridge, UK: Cambridge University Press, c1995-, 2005. (http://doi.org/10.1017/S1351324905003840)
  18. Chloé Kiddon, Luke Zettlemoyer, and Yejin Choi. Globally Coherent Text Generation with Neural Checklist Models In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 329–339, 2016. (http://doi.org/10.18653/v1/D16-1032)
  19. Michal Křen, Václav Cvrček, Tomáš Čapka, Anna Čermáková, Milena Hnátková, Lucie Chlumská, Tomáš Jelínek, Dominika Kováříková, Vladimír Petkevič, Pavel Procházka, and others. Korpus SYN, verze 8 z 12. 12. 2019, Praha: Ústav Českého národního korpusu FF UK. Available from http://www.korpus.cz, 2019.
  20. Majid Laali and Leila Kosseim. Improving Discourse Relation Projection to Build Discourse Annotated Corpora arXiv preprint arXiv:1707.06357, 2017. (http://doi.org/10.26615/978-954-452-049-6_054)
  21. Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, and Zdeněk Žabokrtský. Prague Czech-English Dependency Treebank 2.0, Data/Software, Linguistic Data Consortium, University of Pennsylvania, Philadelphia. LDC2012T08, 2012.
  22. Rashmi Prasad, Bonnie Webber, Alan Lee, and Aravind Joshi. Penn Discourse Treebank Version~3.0, Data/Software, Linguistic Data Consortium, University of Pennsylvania, Philadelphia. LDC2019T05, 2019.
  23. Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. Treebank-2, Data/Software, Linguistic Data Consortium, University of Pennsylvania, Philadelphia. LDC95T7, 1995.
  24. Amália Mendes, Iria del Rio, Manfred Stede, and Felix Dombek. A Lexicon of Discourse Markers for Portuguese–LDM-PT In 11th International Conference on Language Resources and Evaluation, pages 4379–4384, 2018.
  25. Thomas Meyer. Disambiguating temporal-contrastive connectives for machine translation In Proceedings of the ACL 2011 Student Session, pages 46–51, Association for Computational Linguistics, Portland, OR, USA, 2011.
  26. Thomas Meyer and Bonnie Webber. Implicitation of Discourse Connectives in (Machine) Translation In Proceedings of the Workshop on Discourse in Machine Translation, pages 19–26, 2013.
  27. Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi, and Bonnie Webber. The Penn Discourse Treebank In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC'04), European Language Resources Association (ELRA), Lisbon, Portugal, 2004.
  28. Jiří Mírovský and Lucie Poláková. Sense Prediction for Explicit Discourse Relations with BERT In Proceedings of Sixth International Congress on Information and Communication Technology (ICICT) 216, pages 835–842, Springer, Singapore, 2021.
  29. Sebastian Padó and Mirella Lapata. Cross-Lingual Annotation Projection for Semantic Roles Journal of Artificial Intelligence Research 36, pages 307–340, 2009. (http://doi.org/10.1613/jair.2863)
  30. Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, and Bonnie Webber. The Penn Discourse TreeBank 2.0 In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), pages 2961–2968, European Language Resources Association, Marrakech, 2008.
  31. Charlotte Roze, Laurence Danlos, and Philippe Muller. LEXCONN: A French Lexicon of Discourse Connectives Discours. Revue de linguistique, psycholinguistique et informatique, Laboratoire LATTICE, UMR 8094 ENS/CNRS, 2012. (http://doi.org/10.4000/discours.8645)
  32. Kateřina Rysová, Magdaléna Rysová, and Jiří Mírovsk\`y. Automatic Evaluation of Surface Coherence in L2 Texts in Czech In Proceedings of the 28th Conference on Computational Linguistics and Speech Processing (ROCLING 2016), pages 214–228, 2016.
  33. Tatjana Scheffler and Manfred Stede. Adding Semantic Relations to a Large-Coverage Connective Lexicon of German In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC'16), pages 1008–1013, European Language Resources Association (ELRA), Portorož, Slovenia, 2016.
  34. Henny Sluyter-Gäthje, Peter Bourgonje, and Manfred Stede. Shallow Discourse Parsing for Under-Resourced Languages: Combining Machine Translation and Annotation Projection In Proceedings of The 12th Language Resources and Evaluation Conference, pages 1044–1050, 2020.
  35. Manfred Stede. DiMLex: A Lexical Approach to Discourse Markers In Exploring the Lexicon – Theory and Computation, Alessandria (Italy): Edizioni dell'Orso, 2002.
  36. Manfred Stede, Tatjana Scheffler, and Amália Mendes. Connective-Lex: A Web-Based Multilingual Lexical Resource for Connectives Discours. Revue de linguistique, psycholinguistique et informatique. A journal of linguistics, psycholinguistics and computational linguistics, Presses universitaires de Caen, 2019. (http://doi.org/10.4000/discours.10098)
  37. Peter D Turney and Michael L Littman. Measuring Praise and Criticism: Inference of Semantic Orientation from Association ACM Transactions on Information Systems (TOIS) 21, pages 315–346, ACM New York, NY, USA, 2003. (http://doi.org/10.1145/944012.944013)
  38. Yannick Versley. Discovery of Ambiguous and Unambiguous Discourse Connectives via Annotation Projection In Proceedings of Workshop on Annotation and Exploitation of Parallel Corpora (AEPC), pages 83–82, 2010.
  39. Hao Xiong, Zhongjun He, Hua Wu, and Haifeng Wang. Modeling Coherence for Discourse Neural Machine Translation In Proceedings of the AAAI Conference on Artificial Intelligence 33, pages 7338–7345, 2019. (http://doi.org/10.1609/aaai.v33i01.33017338)
  40. Nianwen Xue, Hwee Tou Ng, Sameer Pradhan, Rashmi Prasad, Christopher Bryant, and Attapol Rutherford. The CoNLL-2015 Shared Task on Shallow Discourse Parsing In Proceedings of the Nineteenth Conference on Computational Natural Language Learning-Shared Task, pages 1–16, 2015. (http://doi.org/10.18653/v1/K15-2001)
  41. Nianwen Xue, Hwee Tou Ng, Sameer Pradhan, Attapol Rutherford, Bonnie Webber, Chuan Wang, and Hongmin Wang. CoNLL 2016 Shared Task on Multilingual Shallow Discourse Parsing In Proceedings of the CoNLL-16 shared task, pages 1–19, 2016. (http://doi.org/10.18653/v1/K16-2001)
  42. David Yarowsky and Grace Ngai. Inducing Multilingual POS Taggers and NP Bracketers via Robust Projection across Aligned Corpora In Second Meeting of the North American Chapter of the Association for Computational Linguistics, 2001. (http://doi.org/10.3115/1073336.1073362)
  43. Renxian Zhang. Sentence Ordering Driven by Local and Global Coherence for Summary Generation In Proceedings of the ACL 2011 Student Session, pages 6–11, 2011.