Contribution: Zero marking and the order of core arguments

by Kaius Sinnemäki and Noora Ahola

The original version of this dataset was created in 2010 for the following article:

Sinnemäki, Kaius. 2010. Word order in zero-marking languages. Studies in Language 34(4). 869–912. (doi:10.1075/sl.34.4.04sin)

The original article contained data on 848 languages, but here we have added data on 47 more languages, also fixing some errors in the original data. This dataset thus contains data on 895 languages, from 469 genera. This repository has been created by Kaius Sinnemäki with the assistance of Noora Ahola. We are also grateful to Viljami Haakana for assistance with the bibliography.

The CrossGram dataset is based on the dataset released under https://version.helsinki.fi/hals/sinnemaki/sinnemaki2010, containing all the data and all metadata in a computer-readable form, roughly following the practices of the AUTOTYP database (Bickel et al. 2022: doi:10.5281/zenodo.5931509).

Name Glottocode Family Source Examples
Details L-Parameter Description Topics Representation
Details Name Title Author Year Languages