The treebank data was derived by translating sentences from the following Turkish UD treebanks into respective languages and using silver annotation process:
Training data — translated from:
- UD_Turkish-IMST (
tr_imst-ud-train.conllu) - UD_Turkish-GB (
tr_gb-ud-test.conllu) - UD_Turkish-FrameNet (
tr_framenet-ud-train.conllu)
Test data — translated from:
- ud-turkic/parallel (
tr_tuecl-ud-test.fa.conllu)
- train: based on training data sources above (translated & silver annotated)
- test: taken from here
- train: based on training data sources above (translated & silver annotated)
- test: based on test data source above (translated & silver annotated)
- train: based on training data sources above (translated & silver annotated)
- test taken from here
- train: based on training data sources above (translated & silver annotated)
- test: based on test data source above (translated & silver annotated)
- Turkic UD Group: https://github.com/ud-turkic
- UD_Turkish-IMST: Umut Sulubacak, Gülşen Eryiğit. Implementing Universal Dependency, Morphology and Multiword Expression Annotation Standards for Turkish Language Processing. Turkish Journal of Electrical Engineering & Computer Sciences, DOI: 10.3906/elk-1706-81):1–23. May 2018
- UD_Turkish-GB: Çağrı Çöltekin (2015) A grammar-book treebank of Turkish In: Proceedings of the 14th workshop on Treebanks and Linguistic Theories (TLT 14)
- UD_Turkish-FrameNet: Marşan, B., Kara, N., Özçelik, M., Arıcan, B. N., Cesur, N., Kuzgun, A., ... & Yıldız, O. T. (2021, January). Building the Turkish FrameNet. In Proceedings of the 11th Global Wordnet Conference (pp. 118-125).
- UD-Turkic/Parallel: https://github.com/ud-turkic/parallel