HelexKids: A word frequency database for Greek and Cypriot primary school children

Aris Terzopoulos, Lynne Duncan, Mark A.J. Wilson, Georgia Niolaki, Jackie Masterson

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)
191 Downloads (Pure)

Abstract

In this article, we introduce HelexKids, an online written word database for Greek-speaking children in primary education (grades 1 to 6). The database is organised on a grade by grade basis and on a cumulative basis by combining grade 1 with grades 2 to 6. It provides values for: Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency contextual diversity, orthographic Levenshtein distance and lemma frequencies. These values are derived from 116 textbooks used in primary education in Greece and Cyprus producing a total of 68,692 different word types. HelexKids has been developed to assist researchers in studying language development, educators in selecting age-appropriate items for teaching as well as writers and authors of educational books for Greek/Cypriot children. The database is open access and can be searched online at http://www.helexkids.org
Original languageEnglish
Pages (from-to)83-96
Number of pages14
JournalBehavior Research Methods
Volume49
Issue number1
Early online date28 Jan 2016
DOIs
Publication statusPublished - Feb 2017

Bibliographical note

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Keywords

  • Word database
  • Greek language
  • Children
  • Frequency
  • Contextual diversity

Fingerprint

Dive into the research topics of 'HelexKids: A word frequency database for Greek and Cypriot primary school children'. Together they form a unique fingerprint.

Cite this