How to Cite

Classification of Genres through 500 Years of Spanish Literature in CORDE , in Hesselbach, Robert et al. (Eds.): Digital Stylistics in Romance Studies and Beyond, Heidelberg: Heidelberg University Publishing, 2024, p. 15–36. https://doi.org/10.17885/heiup.1157.c19362

License (Chapter)

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Identifiers (Book)

ISBN 978-3-96822-200-4 (PDF)
ISBN 978-3-96822-201-1 (Hardcover)

Published

08/07/2024

Authors

José Calvo Tello

Classification of Genres through 500 Years of Spanish Literature in CORDE

Abstract In this work I analyze the development of numerous genres in almost five hundred years of Spanish literature. For that, I use a large diachronic cor­pus composed by the Real Academia Española. After an introductory section, the dataset is described, focusing on some important aspects of the distribution and balance of the categories. Next, several classification tests are applied in order to find out which parameters lead to the highest scores, and what the results for each category are. The variance of these results is then explored using linear regression, resulting in the length of the text being a good predictor for the classification results. Finally, the historical evolution is analyzed, showing that the classification results for genres neither get better nor worse over time, but remain stable.

Keywords: genre, classification, Spanish literature, corpora