NotesFAQContact Us
Collection
Advanced
Search Tips
Back to results
Peer reviewed Peer reviewed
PDF on ERIC Download full text
ERIC Number: EJ1284889
Record Type: Journal
Publication Date: 2021
Pages: 15
Abstractor: As Provided
ISBN: N/A
ISSN: EISSN-1305-578X
EISSN: N/A
Identifying Themes in Fiction: A Centroid-Based Lexical Clustering Approach
Omar, Abdulfattah
Journal of Language and Linguistic Studies, v17 spec iss 1 p580-594 2021
In recent years, numerous computational methods have been developed that have been widely used in humanities and literary studies. In spite of the potential of such methods in providing workable solutions to various inherent problems in research within these domains, including selectivity, objectivity, and replicability, very little empirical work has been done on thematic studies in literature. Such studies are almost entirely undertaken through traditional methods based on individual researchers' reading of texts and intuitive abstraction of generalizations from their reading. This has negative implications in terms of issues of objectivity and replicability. Furthermore, there are challenges in dealing effectively with the hundreds of thousands of new novels that are published every year using traditional methods. In the face of these problems, this study proposes an integrated computational model for the thematic classification of literary texts based on lexical clustering methods. This study is based on a corpus comprising Thomas Hardy's novels and short stories. The study employs computational semantic analysis based on a vector space model (VSM) representation of the lexical content of the texts. The results indicate that the selected texts could be grouped thematically based on their semantic content. Thus, there is now evidence that text clustering approaches, which have long been used in computational theory and data mining applications, can be usefully applied in literary studies.
Journal of Language and Linguistic Studies. Hacettepe Universitesi, Egitim Fakultesi B Blok, Yabanci Diller Egitimi Bolumu, Ingiliz Dili Egitimi Anabilim Dali, Ankara 06800, Turkey. e-mail: jllsturkey@gmail.com; Web site: http://www.jlls.org
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A