Grammar & Resources

The group is centered on modeling linguistic knowledge, integrating interfaces between different areas of grammar and knowledge about how language is put to use. Joint work in formal phonology, lexicon, syntax and semantics allows building an integrated model of grammar, considering how it is represented in the human mind, as well as how it can be computationally modelled; work on L1 and L2 acquisition is at the core of this work. The integration of models of language representation and models of language use is achieved through the study of corpora.

The production of corpora and resources is justified by the goal of developing documentation and providing descriptions of contemporary European Portuguese, but also of understudied contact languages or varieties (Portuguese-based creoles, national varieties of Portuguese in Africa and Asia). The group also produces resources for the study of L1 and L2 acquisition in different settings. The group integrates CLARIN LP.

Research on L1 and L2 acquisition contributes to CLUL’s general purpose of effectively articulating fundamental and applied research, namely in the areas of Educational Linguistics and Clinical Linguistics.

General goals:

- To produce new resources for the study of Portuguese and Portuguese-based creoles;

- To pursue basic research on natural language modeling, integrating knowledge on interfaces between language modules;

- To continue the documentation and description of understudied creoles and new varieties of Portuguese that emerged in a context of language contact;

- To develop the study of language acquisition with an emphasis on language contact situations (see new international Heritage Language Consortium) and on the comparison between typical and atypical development;

- To explore the potential of comparative linguistics in the production of resources for translation and to promote connections with the industry in the area of translation.

 

Resources Type
A Lexicon of Child European Portuguese - CEPLEXicon Lexicon
A Portuguese Native Language Identification Dataset - NLI-PT Database
Acquisition of European Portuguese Databank - AcEP Database
Child-Adult Interaction Corpus - CAI Corpus
Child-Adult interaction European Portuguese Database
Consonantic Sequences Oral and Written Production Tasks - PORESC Tool
Controlled Portuguese - CLG Database
Corpora of PLE Corpus
Corpus Almeida - European Portuguese / French Corpus
Corpus Angolar Corpus
Corpus C-ORAL-ROM Corpus
Corpus CCF Corpus
Corpus CINTIL Corpus
Corpus Fadambo Corpus
Corpus Leiria (1991) Corpus
Corpus of Cape Verdean Portuguese Corpus
Corpus of Sri Lanka Portuguese Corpus
Corpus of the Diaries of the Portuguese Parliament annotated with PoS - PTPARL Corpus
Corpus PESTRA Corpus
Corpus Português Fundamental - Corpus PF Corpus
Corpus Principense Corpus
Corpus REDIP Corpus
Corpus Santome Corpus
Corpus SANTOS - European Portuguese Corpus
Crosslinguistic Child Phonology Project - Português Europeu - CLCP-PE Tool
Dados Orais de Cabo Verde - CV Words Database
Demo de Subespecificação e Desambiguação de Escopo Tool
Dictionary of Hindi-Portuguese-Hindi Database
Diu Indo-Portuguese Data Set Database
Learner Corpus of Portuguese L2 - COPLE2 Corpus
LT Corpus (Literary Corpus) - LT Corpus Corpus
Modality Lexicon - MODAL-LEX-PT Lexicon
Multifunctional Computational Lexicon of Contemporary Portuguese Lexicon
Named Entity Recognizer - CRPC-NER Tool
Nominal Multiword Lexical Units in European Portuguese Lexicon
NPChunks: Corpus of 1000 sentences annotated with PoS and nominal chunks - NPChunks Corpus
Online Corpus of Writing and Speech of Children in the Early Years of Schooling - EFFE-On Corpus
Online Dictionary Portuguese-Slovak/Slovak-Portuguese Database
Pereira&Freitas - EP Corpus
Person-Machine Interaction in Natural Language - INQUER Database
PhonoDis Corpus
Phonological Awareness Tasks for First Grade School Children - TCFC Tool
Portuguese Biographies - Bio-PT Database
Portuguese Corpus Annotated for Modality - MODAL Corpus
Portuguese Lexicon of Discourse Markers - LDM-PT Lexicon
Portuguese Technical Lexica - LEXTEC Lexicon
Portuguese Discourse Bank - CRPC-DB Corpus
Quotations database - CRPC-quotations Database
Ramalho – EP Corpus
Reference Corpus of Contemporary Portuguese - CRPC Corpus
Santome Structure Dataset Database
Spoken Corpus Mozambique 1986-87 - SCM Corpus
Spoken Portuguese - Geographical and Social Varieties Corpus
Vocatives in European Portuguese Corpus
Word Combination in European Portuguese - LEX-MWE-PT Lexicon
WordNet.PT Lexicon
Capítulo de Livro
Costa, A. L., Costa, A., & Gonçalves, A. (2017). Conhecimento implícito, consciência linguística e escrita. In A aquisição de língua materna e não materna. Questões gerais e dados do Português (M. J. Freitas and A. L. Santos). Berlin: Language Science Press. Retrieved from http://langsci-press.org/
Costa, A. L., Costa, A., & Gonçalves, A. (2017). Consciência linguística: aspetos sintáticos. In Aquisição de Língua Materna e Não Materna. Questões Gerais e Dados do Português (M. J. Freitas and A. L. Santos, pp. 409-438). Questões Gerais e Dados do Português. Berlin: Language Science Press.
Costa, J., Freitas, M. J., & Gonçalves, A. (2016). Linguística Clínica: alguns dados sobre o português. In In A. M. Martins & E. Carrilho (eds.) Manual de Linguística Portuguesa (pp. 278-307). Berlin: De Gruyter.
Cyrino, S., & Matos, G. (2016). Null Objects and VP ellipsis in European and Brazilian Portuguese. In The Handbook of Portuguese Linguistics (Wetzels, L.; Menuzzi, S.; Costa, J, pp. 294-317). Malden, MA / Oxford: Wiley-Blackwell. http://doi.org/10.1002/9781118791844
Duarte, I., & Silva, F. (2016). The Null Subject Parameter and the Structure of the Clause in European and Brazilian Portuguese. In The Handbook of Portuguese Linguistics (Wetzels, Menuzzi and Costa, pp. 234-253). Malden/Oxford: Wiley Blackwell.
Duarte, I. (2015). Para a internacionalização da língua portuguesa: factos e políticas. In Pelos Mares da Língua Portuguesa 2 (A. M. Ferreira and M. F. Brasete, pp. 617-631). Aveiro: UA Editora.
Duarte, I., Santos, A. L., & Gonçalves, A. (2016). O infinitivo flexionado na gramática do adulto e na aquisição de L1. In Manual de Linguística Portuguesa (A. M. Martins & E. Carrilho, pp. 453-480). Berlin: Mouton De Gruyter.
-Gonçalves, F., & Freitas, M. J. (2016). Acquisition of Phonology. In L. Wetzels, Costa, J. A., & Menuzzi, S. (Eds.), The Handbook of Portuguese Linguistics (pp. 545-561). UK: Wiley Blackwell.
Freitas, M. J. (2016). A Sílaba na Gramática do Adulto e na Aquisição de Língua Materna. In Manual de Linguística Portuguesa (A. M. Martins and E. Carrilho, pp. 663-687). Berlin, Boston: MRL Series.
Freitas, M. J. (2017). Aquisição da fonologia em língua materna: a sílaba. In A aquisição de língua materna e não materna. Questões gerais e dados do Português (M. J. and A. L. Santos, pp. 71-94). Berlin: Language Science Press. Retrieved from http://langscipress.org/
Gonçalves, A., Carrilho, E., & Pereira, S. (2016). Predicados complexos numa perspetiva comparativa. In , in Manual de Linguística Portuguesa (A. M. Martins & E. Carrilho, pp. 523-557). Berlin/Boston: Mouton De Gruyter.
Hagemeijer, T. (2015). The Gulf of Guinea creoles: A case-study of syntactic reconstruction. In Perspectives on Historical Syntax (C. Viti, pp. 291-316). Amsterdam/Philadelphia: John Benjamins.
Hagemeijer, T. (2016). O português em contacto em África. In Manual de Linguística Portuguesa (A. M. Martins and E. Carrilho, pp. 43-67). Berlim: Mouton de Gruyter.
Lejeune, P., & Valentim, H. (2015). Étude contrastive de quelques valeurs de bien et bem. In Variation, ajustement, interprétation (D. Lebaud & C. Paulin, pp. 95-114). Besançon: Presses Universitaires de Franche-Comté.
Lejeune, P. (2016). Structures linguistiques et problèmes de traduction : schémas nominaux renvoyant à la transformation dans le discours de spécialité. In Linguistique et Traductologie (Catherine Delesse, Catherine Chauvin, Maryvonne Boisseau, Yvon Keromnes). Arras: Presses Universitaires d’Artois.
Lousada, M., Alves, D., & Freitas, M. J. (2017). Avaliação da produção oral. In A aquisição de língua materna e não materna (M. J. and A. L. Santos). Berlin: Language Science Press. Retrieved from http://langscipress.org/
Lousada, M., Alves, D., & Freitas, M. J. (2017). Avaliação linguística e contextos de desenvolvimento típico e atípico: aspetos fonéticos e fonológicos. In A aquisição de língua materna e não materna. Questões gerais e dados do Português. Berlin: Language Science Press (M. J. and A. L. Santos). Berlin: Language Science Press. Retrieved from http://langsci-press.org/
Marques, R., & de Oliveira, R. P. (2016). Mood and Modality. In The Handbook of Portuguese Linguistics (Catherine Delesse, Catherine Chauvin, Maryvonne Boisseau, Yvon Keromnes, Vol. 22, pp. 408-424). WileyBlackwell.
Marques, R. (2016). O modo conjuntivo. In Manual de linguística portuguesa (MRL 16) (Ana Maria Martins e Ernestina Carrilho, Vol. 23, pp. 610-635). Berlim: De Gruyter Mouton.
Marques, R., Silvano, P., Gonçalves, A., & Santos, A. L. (2015). Sequence of tenses in complementation structures: Lexical restrictions and effects on language acquisition. In Hispanic Linguistics at the Crossroads. Theoretical Linguistics. Language Acquisition and Language Contact (R. Klassen, J. M. Liceras & E. Valenzuela, pp. 69-88). Amsterdam/Philadelphia: John Benjamins.
Martins, A. M., Santos, A. L., & Duarte, I. (2017). Syntactic complexity in children with Autism Spectrum Disorder and Specific Language Impairment. In Language Processing and Disorders (L. Escobar and V. Torrens and T. Parodi). Newcastle: Cambridge Scholars Publishing.
Mata, A. I., & Moniz, H. (2016). Prosódia, variação e processamento automático. In Manual de Linguística Portuguesa. MRL Series (Ana Maria Martins and Ernestina Carrilho, pp. 116-155). .
Mata, A. I., Moniz, H., & Batista, F. (2016). Stylistic variation in the intonation of European Portuguese teenagers and adults. In Intonational grammar in Ibero-Romance: Approaches across linguistic subfields (Meghan E. Armstrong and Nicholas Henriksen and Maria del Mar Vanrell, pp. 45-68). Amsterdam/ Philadelphia: John Benjamins.
Matos, G. (2017). TP Ellipsis with polarity particles. In Romance Languages and Linguistic Theory 12 (R. E. V. Lopes and J. O. Avelar and S. Cyrino, pp. 143-158). Amsterdam/Philadelphia: John Benjamins Publishing. http://doi.org/https://doi.org/10.1075/rllt.12
Matzenauer, C., & Costa, T. (2017). Aquisição da fonologia em língua materna: os segmentos. In A aquisição de língua materna e não materna. Questões gerais e dados do Português (M. J. and A. L. Santos). Berlin: Language Science Press. Retrieved from http://langsci-press.org/
Mendes, A., & Antunes, S. (2016). Collocations in Portuguese: A corpus-based approach to lexical patterns. In B. V. Sanromán (Ed.), Collocations Cross-Linguistically. Corpora, Dictionaries and Language Teaching (pp. 141-166). Sanromán, Helsinki: Société Néophilologique.
Mendes, A. (2016). Linguística de Corpus e outros usos do corpus em Linguística. In Manual de Linguística Portuguesa. (Ana Martins, Ernestina Carrilho (eds.), Vol. 16, pp. 224-251). Berlin/Boston: Walter de Gruyter.
Mendes, A., Estrela, A., Bacelar do Nascimento, M. F., Pereira, L., & Antunes, S. (2016). New words, old suffixes: Nominal derivation in the African varieties of Portuguese compared to European Portuguese. In R. Muhr, Duarte, E., Mendes, A., Negre, A., & Thomas, J. A. (Eds.), Pluricentric Languages and Non-Dominant Varieties Worldwide: Volume 2: The pluricentricity of Portuguese and Spanish: New concepts and descriptions (pp. 121-136). Wien/Frankfurt: Peter Lang Verlag.
Mendes, A., Duarte, I., Bacelar do Nascimento, M. F., Pereira, L., & Estrela, A. (2015). Pronominal constructions and subject indetermination in national varieties of Portuguese – A global view on norms. In R. Muhr & Marley, D. (Eds.), Pluricentric Languages. New Perspectives in Theory and Description (pp. 109-125). Frankfurt/Wien: Peter Lang Verlag.
Moniz, H., Batista, B., Mata, A. I., & Trancoso, I. (2016). Towards automatic language processing and intonational labeling in European Portuguese. In Intonational grammar in IberoRomance: Approaches across linguistic subfields (Meghan E. Armstrong and Nicholas Henriksen and Maria del Mar Vanrell, pp. 227-248). Amsterdam/ Philadelphia: John Benjamins.