Prague to Penn Discourse Transformation

Jiří Mírovský, Magdaléna Rysová, Pavlína Synková, Lucie Poláková

References:

  1. Jan Hajič, Eduard Bejček, Alevtina Bémová, Eva Buráňová, Eva Fučíková, Eva Hajičová, Jiří Havelka, Jaroslava Hlaváčová, Petr Homola, Pavel Ircing, Jiří Kárník, Václava Kettnerová, Natalia Klyueva, Veronika Kolářová, Lucie Kučová, Markéta Lopatková, David Mareček, Marie Mikulová, Jiří Mírovský, Anna Nedoluzhko, Michal Novák, Petr Pajas, Jarmila Panevová, Nino Peterek, Lucie Poláková, Martin Popel, Jan Popelka, Jan Romportl, Magdaléna Rysová, Jiří Semecký, Petr Sgall, Johanka Spoustová, Milan Straka, Pavel Straňák, Pavlína Synková, Magda Ševčíková, Jana Šindlerová, Jan Štěpánek, Barbora Štěpánková, Josef Toman, Zdeňka Urešová, Barbora Vidová Hladká, Daniel Zeman, Šárka Zikánová, and Zdeněk Žabokrtský. Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0), LINDAT/CLARIAH-CZ digital library, Czech Republic, 2020.
  2. Jiří Mírovský, Pavlína Jínová, and Lucie Poláková. Discourse Relations in the Prague Dependency Treebank 3.0 In The 25th International Conference on Computational Linguistics (Coling 2014), Proceedings of the Conference System Demonstrations, pages 34–38, Dublin City University (DCU), Dublin, Ireland, 2014.
  3. Jiří Mírovský and Lucie Poláková. Sense Prediction for Explicit Discourse Relations with BERT In Proceedings of Sixth International Congress on Information and Communication Technology (ICICT) 216, pages 835–842, Springer, Singapore, 2021.
  4. Jiří Mírovský, Pavlína Synková, Lucie Poláková, Věra Kloudová, and Magdaléna Rysová. CzeDLex 1.0, Charles University, Prague, Czech Republic, 2021.
  5. Petr Pajas and Jan Štěpánek. Recent Advances in a Feature-rich Framework for Treebank Annotation In Proceedings of the 22nd International Conference on Computational Linguistics, pages 673–680, The Coling 2008 Organizing Committee, Manchester, 2008. (http://doi.org/10.3115/1599081.1599166)
  6. Lucie Poláková, Pavlína Jínová, Šárka Zikánová, Zuzanna Bedřichová, Jiří Mírovský, Magdaléna Rysová, Jana Zdeňková, Veronika Pavlíková, and Eva Hajičová. Manual for Annotation of Discourse Relations in Prague Dependency Treebank, pages 1–83, Institute of Formal and Applied Linguistics, Charles University, Prague, Czech Republic, 2012.
  7. Lucie Poláková, Jiří Mírovský, Anna Nedoluzhko, Pavlína Jínová, Šárka Zikánová, and Eva Hajičová. Introducing the Prague Discourse Treebank 1.0 In Proceedings of the Sixth International Joint Conference on Natural Language Processing, pages 91–99, Asian Federation of Natural Language Processing, Nagoya, 2013.
  8. Lucie Poláková and Pavlína Synková. Pragmatické aspekty v popisu textové koherence Naše řeč 104, pages 225–242, Praha, Česká republika, 2021.
  9. Magdaléna Rysová and Kateřina Rysová. The Centre and Periphery of Discourse Connectives In Proceedings of Pacific Asia Conference on Language, Information and Computing, pages 452–459, Bangkok, 2014.
  10. Magdaléna Rysová and Kateřina Rysová. Primary and secondary discourse connectives: Constraints and preferences Journal of Pragmatics 130, pages 16–32, 2018. (http://doi.org/10.1016/j.pragma.2018.03.013)
  11. Magdaléna Rysová, Pavlína Synková, Jiří Mírovský, Eva Hajičová, Anna Nedoluzhko, Radek Ocelák, Jiří Pergler, Lucie Poláková, Veronika Pavlíková, Jana Zdeňková, and Šárka Zikánová. Prague Discourse Treebank 2.0, ÚFAL MFF UK, Prague, Czech Republic, 2016.
  12. Pavlína Synková, Magdaléna Rysová, Jiří Mírovský, Lucie Poláková, Veronika Sheller, Jana Zdeňková, Šárka Zikánová, and Eva Hajičová. Prague Discourse Treebank 3.0, LINDAT/CLARIAH-CZ digital library, Prague, Czech Republic, 2022.
  13. Šárka Zikánová, Eva Hajičová, Barbora Hladká, Pavlína Jínová, Jiří Mírovský, Anna Nedoluzhko, Lucie Poláková, Kateřina Rysová, Magdaléna Rysová, and Jan Václ. Discourse and Coherence. From the Sentence Structure to Relations in Text, ÚFAL, Praha, Czechia, 2015.
  14. Šárka Zikánová, Pavlína Synková, and Jiří Mírovský. Enriched Discourse Annotation of PDiT Subset 1.0 (PDiT-EDA 1.0), Charles University, Prague, Czech Republic, 2018.
  15. Ondřej Bojar, Jiří Mírovský, Kateřina Rysová, and Magdaléna Rysová. Evald Reference-less Discourse Evaluation for WMT18 In Proceedings of the Third Conference on Machine Translation: Shared Task Papers, pages 541–545, 2018. (http://doi.org/10.18653/v1/W18-6432)
  16. Laurence Danlos, Diégo Antolinos-Basso, Chloé Braud, and Charlotte Roze. Vers le FDTB: French Discourse Tree Bank In TALN 2012: 19ème conférence sur le Traitement Automatique des Langues Naturelles, pages 471–478, 2012.
  17. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding Proceedings of the 2019 Conference of NAACL: Human Language Technologies, pages 4171–4186, 2019.
  18. Chloé Kiddon, Luke Zettlemoyer, and Yejin Choi. Globally Coherent Text Generation with Neural Checklist Models In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 329–339, 2016. (http://doi.org/10.18653/v1/D16-1032)
  19. Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, and Zdeněk Žabokrtský. Prague Czech-English Dependency Treebank 2.0, Data/Software, Linguistic Data Consortium, University of Pennsylvania, Philadelphia. LDC2012T08, 2012.
  20. Rashmi Prasad, Bonnie Webber, Alan Lee, and Aravind Joshi. Penn Discourse Treebank Version 3.0, Data/Software, Linguistic Data Consortium, University of Pennsylvania, Philadelphia. LDC2019T05, 2019.
  21. Alan Lee, Rashmi Prasad, Bonnie Webber, and Aravind Joshi. Annotating Discourse Relations with the PDTB Annotator In Proceedings of COLING 2016, The 26th International Conference on Computational Linguistics: System Demonstrations, pages 121–125, 2016.
  22. William C. Mann and Sandra A. Thompson. Rhetorical Structure Theory: Toward a Functional Theory of Text Organization Text 8, pages 243–281, 1988. (http://doi.org/10.1515/text.1.1988.8.3.243)
  23. Thomas Meyer and Bonnie Webber. Implicitation of Discourse Connectives in (Machine) Translation In Proceedings of the Workshop on Discourse in Machine Translation, pages 19–26, 2013.
  24. Marie Mikulová, Alevtina Bémová, Jan Hajič, Eva Hajičová, Jiří Havelka, Veronika Kolářová Lucie Kučová, Markéta Lopatková, Petr Pajas, Jarmila Panevová, Magda Razímová, Petr Sgall, Jan Štěpánek, Zdeňka Urešová, Kateřina Veselá, and Zdeněk Žabokrtský. Annotation on the Tectogrammatical Layer in the Prague Dependency Treebank, 2005.
  25. Jiří Mírovský, Pavlína Jínová, and Lucie Poláková. Does Tectogrammatics Help the Annotation of Discourse? In Proceedings of COLING 2012: Posters, pages 853–862, 2012.
  26. Umangi Oza, Rashmi Prasad, Sudheer Kolachina, Dipti Misra Sharma, and Aravind Joshi. The Hindi Discourse Relation Bank In Proceedings of the third Linguistic Annotation Workshop, pages 158–161, 2009. (http://doi.org/10.3115/1698381.1698410)
  27. Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, and Bonnie Webber. The Penn Discourse TreeBank 2.0 In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), pages 2961–2968, European Language Resources Association, Marrakech, 2008.
  28. Kateřina Rysová, Magdaléna Rysová, and Jiří Mírovsk. Automatic Evaluation of Surface Coherence in L2 Texts in Czech In Proceedings of the 28th Conference on Computational Linguistics and Speech Processing (ROCLING 2016), pages 214–228, 2016.
  29. Wei Shi and Vera Demberg. Next sentence prediction helps implicit discourse relation classification within and across domains In Proceedings of EMNLP-IJCNLP 2019, pages 5794–5800, 2019. (http://doi.org/10.18653/v1/D19-1586)
  30. Maite Taboada and William C Mann. Rhetorical Structure Theory: Looking Back and Moving Ahead Discourse studies 8, pages 423–459, SAGE publications London, Thousand Oaks, CA and New Delhi, 2006. (http://doi.org/10.1177/1461445606061881)
  31. Peter D Turney and Michael L Littman. Measuring Praise and Criticism: Inference of Semantic Orientation from Association ACM Transactions on Information Systems (TOIS) 21, pages 315–346, ACM New York, NY, USA, 2003. (http://doi.org/10.1145/944012.944013)
  32. Bonnie Webber, Matthew Stone, Aravind Joshi, and Alistair Knott. Anaphora and Discourse Structure Computational Linguistics 29, pages 545–587, MIT Press, 2003. (http://doi.org/10.1162/089120103322753347)
  33. Bonnie Webber, Rashmi Prasad, Alan Lee, and Aravind Joshi. The Penn Discourse Treebank 3.0 Annotation Manual Philadelphia, University of Pennsylvania 35, pages 108, 2019.
  34. Hao Xiong, Zhongjun He, Hua Wu, and Haifeng Wang. Modeling Coherence for Discourse Neural Machine Translation In Proceedings of the AAAI Conference on Artificial Intelligence 33, pages 7338–7345, 2019. (http://doi.org/10.1609/aaai.v33i01.33017338)
  35. Nianwen Xue, Hwee Tou Ng, Sameer Pradhan, Rashmi Prasad, Christopher Bryant, and Attapol Rutherford. The CoNLL-2015 Shared Task on Shallow Discourse Parsing In Proceedings of the Nineteenth Conference on Computational Natural Language Learning-Shared Task, pages 1–16, 2015. (http://doi.org/10.18653/v1/K15-2001)
  36. Nianwen Xue, Hwee Tou Ng, Sameer Pradhan, Attapol Rutherford, Bonnie Webber, Chuan Wang, and Hongmin Wang. CoNLL 2016 Shared Task on Multilingual Shallow Discourse Parsing In Proc. of the CoNLL-16 shared task, pages 1–19, 2016. (http://doi.org/10.18653/v1/K16-2001)
  37. Deniz Zeyrek and Murathan Kurfalı. TDB 1.1: Extensions on Turkish Discourse Bank LAW XI 2017, pages 76, 2017. (http://doi.org/10.18653/v1/W17-0809)
  38. Renxian Zhang. Sentence Ordering Driven by Local and Global Coherence for Summary Generation In Proceedings of the ACL 2011 Student Session, pages 6–11, 2011.
  39. Yuping Zhou and Nianwen Xue. PDTB-style discourse annotation of Chinese text In Proceedings of the 50th Annual Meeting of the ACL: Long Papers-Volume 1, pages 69–77, 2012.