Discourse Relations and Discourse Markers
This project brings together several working themes in the topic of discourse studies:
(i) analysis of discourse relations (or rhetorical relations or coherence relations), intra-sentential or inter-sentential, that relate two propositional arguments, either explicitly marked by a connective, or implicitly inferred through the context. (Funding: PORTULAN CLARIN)
Resources:
- CRPC-DB, a Portuguese discourse bank that applies the Penn Discourse Treebank - PDTB style of annotation (Prasad et al., 2008) to a set of texts of the Reference Corpus of Contemporary Portuguese - CRPC.
- Portuguese subcorpus of TED-MDB, a parallel discourse bank annotated in the PDTB style. The corpus includes the English transcription of 6 TED talks and its translation to German, Lithuanian, Polish, Portuguese, Russian and Turkish. This work was developed in the scope of CLUL's participation in the COST Action TextLink, Structuring Discourse in Multilingual Europe.
(ii) analysis of Portuguese discourse markers and their propositional, structural and pragmatic values.
Resources:
- LDM-PT, a lexicon of discourse markers for Portuguese.
- the integration of LDM-PT in the multilingual lexicon of discourse connectives, Connective-Lex.
(2020). TED-MDB Lexicons: Tr-EnConnLex, Pt-EnConnLex. In Proceedings of the First Workshop on Computational Approaches to Discourse (Chloé Braud et al., Eds., pp. 148-153). Association for Computational Linguistics. . |
(2020). TED Multilingual Discourse Bank (TED-MDB): a parallel corpus annotated in the PDTB style. Language Resources And Evaluation, 54 (2), 587-613. http://doi.org/https://doi.org/10.1007/s10579-019-09445-9 . 10579_2019_9445_OnlinePDF-3.pdf457.98 KB
|
(2020). Perguntas-respostas em textos escritos: uma análise no âmbito das relações discursivas. Revista Da Associação Portuguesa De Linguística, 7, 226-241. http://doi.org/10.26334/2183-9077/rapln7ano2020a14 . |
(2019). Connective-lex: A web-based multilingual lexical resource for connectives. Discours, 24. . |
(2018). Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan: European Language Resources Association (ELRA). (Original work published May 7-12, 2018) . |
(2018). Corref-PT: A Semi-Automatic Annotated Portuguese Coreference Corpus. Computación Y Sistemas, 22(4). http://doi.org/10.13053/cys-22-4-3063 . |
(2018). Discourse relations with explicit and implicit arguments: The case of European Portuguese aliás. In Proceedings of the Cross-Linguistic Discourse Annotation: Applications and Perspectives, Final Action Conference TextLink. Toulouse. . |
(2018). A Lexicon of Discourse Markers for Portuguese-LDM-PT. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan: European Language Resources Association (ELRA). (Original work published may) . |
(2018). Using a Discourse Bank and a Lexicon for the Automatic Identification of Discourse Connectives. In Computational Processing of the Portuguese Language, PROPOR 2018 (pp. 211-221). Springer International Publishing. . |
(2018). Designing a corpus-based lexicon for spoken DRDs: semantic considerations. In Proceedings of the Cross-Linguistic Discourse Annotation: Applications and Perspectives, Final Action Conference TextLink (L.M. Ho-Dac & Phillip Mueller, Eds., pp. 29-33). University of Toulouse. . |
(2016). Some considerations on the use of main verbs to express rhetorical relations. In L. Degand, Dér, C., Furkó, P., & Webber, B. (Eds.), Conference Handbook of TextLink – Structuring Discourse in Multilingual Europe Second Action Conference (pp. 11-14). Budapest: Debrecen University Press. (Original work published apr) . |