Preprocessing the Norwegian Web as Corpus (NoWaC) in R

The present script can be used to pre-process data from a frequency list of the Norwegian as Web Corpus (NoWaC). Before using the script, the frequency list should be downloaded from The list is described as ‘frequency list sorted primary alphabetic and secondary by frequency within each character’, and the direct URL is: The download requires signing in to an institutional network. Last, the downloaded file should be unzipped.

Discussion of Labotka et al. (2023)

Slideshare Labotka, D., Sabo, E., Bonais, R., Gelman, S. A., & Baptista, M. (2023). Testing the effects of congruence in adult multilingual acquisition with implications for creole genesis. Cognition, 235, 105387.

Pronominal object clitics in preverbal position are a hard nut to crack for Google Translate

Some Romance languages allow the movement of pronominal object clitics to the preverbal position (Hanson & Carlson, 2014; Labotka et al., 2023). That is, instead of saying La maestra lo ha detto (Italian) ‘The teacher has said it’, it is possible to say Lo ha detto la maestra ‘It has said the teacher’. The latter is a marked phrasing that increases the attention to the subject of the sentence. Furthermore, when the clitic is in preverbal position, the degree of focus on the subject is also dependent on the context.

Discussion of Jost et al. (2019)

Slideshare Jost, E., Brill-Schuetz, K., Morgan-Short, K., & Christiansen, M. H. (2019). Input complexity affects long-term retention of statistically learned regularities in an artificial language learning task. Frontiers in Human Neuroscience, 13, 358.

More refined typology and design in linguistic relativity: The case of motion event encoding

Linguistic relativity is the influence of language on other realms of cognition. For instance, the way movement is expressed in a person’s native language may influence how they perceive movement. Motion event encoding (MEE) is usually framed as a …

Dutch modality exclusivity norms for 336 properties and 411 concepts

Part of the toolkit of language researchers is formed of stimuli that have been rated on various dimensions. The current study presents modality exclusivity norms for 336 properties and 411 concepts in Dutch. Forty-two respondents rated the auditory, …

Web application: Dutch modality exclusivity norms

This app presents linguistic data over several tabs. The code combines the great front-end of Flexdashboard—based on R Markdown and yielding an unmatched user interface—, with the great back-end of Shiny—allowing users to download sections of data they select, in various formats. The hardest nuts to crack included modifying the rows/columns orientation without affecting the functionality of tables. A cool, recent finding was the reactable package. A nice feature, allowed by Flexdashboard, was the use of quite different formats in different tabs.

Language evolution: Current status and future directions

The topic of language evolution is characterised by the scarcity of records, but also by a large flow of research produced within multiple subtopics and perspectives. Over the past few decades, significant advancement has been made on the …