Please use the following text to cite this item or export to a predefined format:
Ainara Estarrona; Izaskun Etxeberria; Ricardo Etxepare; Ander Soraluze and Manuel Padilla-Moyano, 2026, BIM/SAHCOBA corpus: Syntactically Annotated Historical Corpus in Basque, Dspace HiTZ Zentroa, https://hdl.handle.net/20.500.14614/43.
dc.contributor.authorAinara Estarrona
dc.contributor.authorIzaskun Etxeberria
dc.contributor.authorRicardo Etxepare
dc.contributor.authorAnder Soraluze
dc.contributor.authorManuel Padilla-Moyano
dc.date.accessioned2026-06-22T08:32:35Z
dc.date.available2026-06-22T08:32:35Z
dc.date.issued2026-06-17
dc.descriptionBasque in the Making (BIM): A Historical Look at a European Language Isolate and Syntactically Annotated Historical Corpus in Basque (SAHCOBA) are two projects for the construction of a morphosyntactically annotated historical corpus of Basque. This corpus will comprise both part-of-speech and syntactic annotation, and a rich set of metadata structure. Our database will allow us to search the annotated corpus by words, lemmas, grammatical categories, by sequences of grammatical categories, and by specific structural configurations. The BIM project aims to collect the most significant works from the 15th century to the mid 18th century (Archaic and Old Basque), while the SAHCOBA project aims to extend this corpus from the mid 18th century to the mid 20th century (Early and Late Modern Basque) when standard Basque appeared. BIM and SAHCOBA are interdisciplinary projects, where experts on Linguistics and Natural Language Processing take part.
dc.identifier.urihttps://hdl.handle.net/20.500.14614/43
dc.language.isoBasque
dc.publisherHiTZ (University of the Basque Country)
dc.relation.isreferencedbyhttps://doi.org/10.1093/llc/fqab066
dc.rightsCreative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.labelPUB
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.source.urihttps://bim.ixa.eus/
dc.subjectDigital Humanities
dc.subjecthistorical corpus
dc.subjectbasque
dc.subjectdiachronic syntax
dc.titleBIM/SAHCOBA corpus: Syntactically Annotated Historical Corpus in Basque
dc.typecorpus
local.contact.personAinara Estarrona ainara.estarrona@ehu.eus HiTZ center (University of the Basque Country)
local.demo.urihttps://bim.ixa.eus/
local.files.count1
local.files.size2926904
local.has.filesyes
local.size.info600000 tokens
local.sponsornationalFunds RTI2018-098082-J-I00 Ministerio de Ciencia Innovación y Universidades (MICINN) SAHCOBA: Syntactically Annotated Historical Corpus in Basque (MICINN)
local.sponsornationalFunds ANR-17-CE27-0011 Agence Nationale de la Recherche (ANR) Basque in the Making: A Historical Look at a Language Isolate – BIM
metashare.ResourceInfo#ContentInfo.mediaTypetext
This item isPublicly Available
and licensed under:
 Files in this item
Name
BZENTROA-BIM.zip
Size
2.79 MB
Format
application/zip
Description
MD5
f7f1fb550c7c5a0d5548e114fecb2dad
Preview
  File Preview