Wals Roberta Sets 37-70.zip [WORKING]
: Definite (37A) and Indefinite (38A) article systems.
: Obligatory possessive inflection (58A) and possessive classification (59A).
: Ordinal (53A) and distributive (54A) numerals, and numeral classifiers (55A). Nominal Syntax (Chapters 58–64) : WALS roberta sets 37-70.zip
: Perfective/imperfective aspect (65A), past tense (66A), future tense (67A), and the perfect (68A).
For more information on the specific data points, you can explore the Official WALS Features List or the WALS-Bench dataset on Hugging Face. : Definite (37A) and Indefinite (38A) article systems
The "RoBERTa" designation suggests this data has been pre-processed or formatted for use with the (Robustly Optimized BERT Pretraining Approach) large language model, likely for tasks like cross-lingual transfer or testing a model's metalinguistic knowledge. Included Linguistic Features (Chapters 37–70)
: Position of tense-aspect affixes (69A) and the morphological imperative (70A). Use Cases for the Dataset coding of nominal plurality (33A)
: Gender assignment (32A), coding of nominal plurality (33A), and the number of cases (49A).