Grammar & Resources
The group is centered on modeling linguistic knowledge, integrating interfaces between different areas of grammar and knowledge about how language is put to use. Joint work in formal phonology, lexicon, syntax and semantics allows building an integrated model of grammar, considering how it is represented in the human mind, as well as how it can be computationally modelled; work on L1 and L2 acquisition is at the core of this work. The integration of models of language representation and models of language use is achieved through the study of corpora.
The production of corpora and resources is justified by the goal of developing documentation and providing descriptions of contemporary European Portuguese, but also of understudied contact languages or varieties (Portuguese-based creoles, national varieties of Portuguese in Africa and Asia). The group also produces resources for the study of L1 and L2 acquisition in different settings. The group integrates CLARIN LP.
Research on L1 and L2 acquisition contributes to CLUL’s general purpose of effectively articulating fundamental and applied research, namely in the areas of Educational Linguistics and Clinical Linguistics.
General goals:
- To produce new resources for the study of Portuguese and Portuguese-based creoles;
- To pursue basic research on natural language modeling, integrating knowledge on interfaces between language modules;
- To continue the documentation and description of understudied creoles and new varieties of Portuguese that emerged in a context of language contact;
- To develop the study of language acquisition with an emphasis on language contact situations (see new international Heritage Language Consortium) and on the comparison between typical and atypical development;
- To explore the potential of comparative linguistics in the production of resources for translation and to promote connections with the industry in the area of translation.
Membros
Integrated members with PhD
Integrated members without PhD
Colaboradores
Concluded
Project | Date | Fin. |
---|---|---|
ParlaMint II - ParlaMint II | - | |
PALMA - Possession and Location: Microvariation in African Varieties of Portuguese (PALMA) | - | FCT
|
RECAP - RECAP: Resources for Portuguese Learning | - | FCG
|
CLARIN - CLARIN | - | |
Documentation of Sri Lanka Portuguese | - | |
LeCIEPLE - LeCIEPLE - Learner Corpus: da investigação ao ensino de Português Língua Estrangeira/Língua Segunda | - | FCG
|
Portuguese-based creoles of the Dravidian space: Diachrony and synchrony | - | |
TAXE - TAXE - Parataxis, Hypotaxis and Interface Syntax-Discourse | - | |
COPAS - COPAS - Contrast and Parallelism in Speech | - | |
CLAP - CLAP - Complement clauses in the Acquisition of Portuguese | FCT
|
|
SemiAutLex.PT - SemiAutLex.PT - Semi-automatic construction of relational lexica for Portuguese | - | FCT
|
SynExtract - SynExtract - automatic extraction of synonymy relations for a cost-effective acquisition of language resources | - | FCT
|
(2000). Novos dados acerca de /#øS$C/. In Actas do XV Encontro Nacional da Associação Portuguesa de Linguística (pp. 287-299). Faro: APL. . |
(2000). Espaço acústico das vogais acentuadas de Braga. In Actas do XV Encontro Nacional da Associação Portuguesa de Linguística (pp. 301-315). Faro: APL. . |
(1999). Das escolas e das culturas: história de uma sequência consonântica. In Actas do XIV Encontro Nacional da Associação Portuguesa de Linguística (Volume II, pp. 117-133). Aveiro: APL. . |
(1999). CPE VAR (Corpus de Português Europeu - Variação). In Poster in Actas do XIV Encontro Nacional da Associação Portuguesa de Linguística (Vol. II, pp. 627-629). Aveiro: APL. . |
(1994). Nova proposta de datação de três manuscritos medievais. In Actas do IX Encontro Nacional da Associação Portuguesa de Linguística (pp. 363-376). Coimbra: APL - Colibri. . |
(2020). Infrastructure for the Science and Technology of Language PORTULAN CLARIN. In LREC 2020 Worskhop IWLTP 2020 – 1st International Workshop on Language Technology Platforms (pp. 1-7). ELRA. . |
(2004). The vowel [ɨ] in the acquisition of European Portuguese. In J. van Kampen & Baauw, S. (Eds.), GALA 2003 (pp. 163-174). Utrecht: LOT. . |
(2005). Parataxe como coordenação e justaposição – evidência a partir de um caso de elipse. In Actas do XX Encontro Nacional da Associação Portuguesa de Linguística (Duarte, I.; Leiria, I. , pp. 687-699). Lisboa: Associação Portuguesa de Linguistica. Retrieved from https://apl.pt/wp-content/uploads/2017/12/2004-55.pdf . |
(2005). Construções contrastivas de focalização: adversativas vs. concessivas. In Actas do XX Encontro Nacional da Associação Portuguesa de Linguística (Duarte, I.; Leiria, I.). Retrieved from https://apl.pt/wp-content/uploads/2017/12/2004-56.pdf . |
(2004). Coordenação Frásica vs. Subordinação Adverbial. In Actas do XIX Encontro Nacional da Associação Portuguesa de Linguística (Freitas, T.; Mendes, A., pp. 555-567). Lisboa: Associação Portuguesa de Linguistica. Retrieved from https://apl.pt/wp-content/uploads/2017/12/2003-45.pdf . |
(1999). Competitive information sources in referential ambiguity resolution. In Psycholinguistics on the Threshold of the Year 2000 — Proceedings of 5th International Congress of the International Society of Applied Pshycholinguistics (ISAPL 97) (Pinto, M. G.; Veloso, J.; Maia, B., pp. 133-138). Porto: Faculdade de Letras da Universidade do Porto. Retrieved from https://apl.pt/wp-content/uploads/2017/12/1997-16.pdf . |
(1995). Estruturas Binárias e Monocêntricas em Sintaxe — algumas observações sobre a coordenação de projecções máximas. In Actas do X Encontro Nacional da Associação Portuguesa de Linguística, 1994 (pp. 301-315). Évora: Edições Colibri, APL. Retrieved from https://apl.pt/wp-content/uploads/2017/12/1994-23.pdf . |
(1998). Ambiguidade referencial na identificação do sujeito em estruturas coordenadas. In Actas do XIII Encontro Nacional da Associação Portuguesa de Linguística, 1997 (Mota, M.A,; Marquilhas, R. , pp. 173-188). Lisboa: Edições Colibri / APL . Retrieved from https://apl.pt/wp-content/uploads/2017/12/1997-16.pdf . |
(1997). Functional Categories in Early Acquisition of European Portuguese. In Proceedings of Gala' 97 Conference on Language Acquisition (Sorace, A.; Heycock, C.; Shillcock, R., pp. 115-120). . |
(1996). A Sintaxe e a Morfo-Sintaxe nas Gramáticas Descritivas do Século XX. In Actas do XI Encontro Nacional da Associação Portuguesa de Linguística, 1995 (Duarte, I.; Miguel, M. , pp. 105-121). Lisboa: Edições Colibri / APL. Retrieved from https://apl.pt/wp-content/uploads/2017/12/1995-10-2.pdf . |
(1989). Elipse do SV em estruturas predicativas com ser e estar. In Actas do IV Encontro Nacional da Associação Portuguesa de Linguística (pp. 41-67). Lisboa: Reprografia da Associação de Estudantes da Faculdade de Letras de Lisboa . Retrieved from https://apl.pt/wp-content/uploads/2017/12/1988-5.pdf . |
(2020). TED-MDB Lexicons: Tr-EnConnLex, Pt-EnConnLex. In Proceedings of the First Workshop on Computational Approaches to Discourse (Chloé Braud et al., Eds., pp. 148-153). Association for Computational Linguistics. . |
(2018). Designing a corpus-based lexicon for spoken DRDs: semantic considerations. In Proceedings of the Cross-Linguistic Discourse Annotation: Applications and Perspectives, Final Action Conference TextLink (L.M. Ho-Dac & Phillip Mueller, Eds., pp. 29-33). University of Toulouse. . |
(2004). The acquisition of the Prosodic Word in European Portuguese. In Second Lisbon Meeting on Language Acquisition. Lisboa. . |
(2022). The PALMA Corpora of African Varieties of Portuguese. In N. Calzolari, Béchet, F., Blache, P., Choukri, K., Declerck, T., Goggi, S., et al. (Eds.), Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022) (Marseille, 20-25 June 2022. Paris: European Language Resources Association (ELRA), pp. 5047-5053). . |
(1996). Aspectos fonéticos do Barlavento do Algarve: as vogais finais acentuadas. In I. Duarte & Leiria, I. (Eds.), Actas do Congresso Internacional sobre o Português Vol. II (1994) (pp. 345-358). Lisboa: APL e Eds Colibri. . |
(2023). "Otraves" o mesmo "faitico": a proficiência ortográfica nos dígrafos e de crianças alentejanas e transmontanas do 2.º ano de escolaridade. In C. Amorim & Zhou, C. (Eds.), Atas do II Phonoshuttle OPO-LIS: Ponte aérea de fonologia (pp. 53-62). Retrieved from https://ler.letras.up.pt/uploads/ficheiros/19671.pdf . |
(2024). Compiling and Exploring a Portuguese Parliamentary Corpus - ParlaMint-PT. In D. Fiser, Eskevich, M., & Gordon, D. (Eds.), Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024 (pp. 12-20). Torino, Italy: ELRA and ICCL. . |
(2024). Investigating the Generalizability of Portuguese Readability Assessment Models Trained Using Linguistic Complexity Features. In P. Gamallo, Claro, D., Teixeira, A., Real, L., Garcia, M., Oliveira, H., & Amaro, R. (Eds.), Proceedings of the 16th International Conference on Computational Processing of Portuguese - PROPOR 2024 (pp. 332-341). "Santiago de Compostela, Galicia/Spain": Association for Computational Lingustics. . |
(2024). Multiple Discourse Relations in English TED Talks and Their Translation into Lithuanian, Portuguese and Turkish. In P. Zweigenbaum, Rapp, R., & Sharoff, S. (Eds.), Proceedings of the 17th Workshop on Building and Using Comparable Corpora (BUCC) @ LREC-COLING 2024 (pp. 125-134). Torino, Italia: ELRA and ICCL. . |
(2020). Query Strategies, Assemble! Active Learning with Expert Advice for Low-resource Natural Language Processing. 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE. http://doi.org/10.1109/fuzz48607.2020.9177707 . |
(2021). PALMA Corpus São Tomé e Príncipe . Lisboa: Centro de Linguística da Universidade de Lisboa. . |
(2021). PALMA Corpus Moçambique. Lisboa: Centro de Linguística da Universidade de Lisboa. . |
(2021). PALMA Corpus Angola. Lisboa: Centro de Linguística da Universidade de Lisboa. . |
(2022). A casa na quinta: das palavras às frases. Lisboa: Direção Geral de Educação. Retrieved from https://redge.dge.mec.pt/ilha/por4/ . |