Lexical patterns in Indonesian academic writing: a corpus-based analysis for writing assistance
Keywords:
academic writing, corpus linguistics, Indonesian language, lexical bundles, register variationAbstract
Background: Despite growing demands for academic publication in Indonesian higher education, many students and early-career researchers struggle with appropriate lexical usage. Objective: This study identifies lexical features that characterize proficient Indonesian academic writing through a corpus-based analysis. Method: A large corpus of journal articles and theses was analyzed using frequency, lexical bundle, and keyword analysis across sections and registers. Results: Academic texts are dominated by abstract nouns and procedural verbs, while lexical bundles show strong sectional specialization; keyword analysis highlights contrasts with non-academic texts in abstraction, impersonality, and epistemic stance. Implication: These findings support the development of corpus-informed writing instruction and tools to enhance academic literacy in Indonesian contexts. Novelty: This study integrates section-sensitive lexical analysis with register-based comparison to provide a comprehensive empirical model of academic lexical proficiency.
References
[1] A. Fadhil, W. Gunawan, and Y. Wirza, “Lexical Density in EFL Indonesian Textbooks: A Comparative Analysis,” JALL (Journal of Applied Linguistics and Literacy), Feb. 2023, doi: 10.25157/jall.v7i1.9727.
[2] E. Sujatna, H. Heriyanto, and S. Andri, “Lexical density and variation in Indonesian folklores in English student textbooks: an SFL study,” Leksika: Jurnal Bahasa, Sastra dan Pengajarannya, Sep. 2021, doi: 10.30595/lks.v15i2.11102.
[3] A. D. G. Abi, “Lexical Richness in Essays: A Corpus Based Study among Indonesian Junior High School Students,” JOURNAL OF ENGLISH AND EDUCATION, Jun. 2024, doi: 10.31327/jee.v9i1.2166.
[4] A. Fawaid and H. S. Malika, “The Role of Formulaic Expressions through Lesson Study in Improving Writing Skills of Islamic Elementary Students in Indonesian Language Subject,” Al-Madrasah Jurnal Pendidikan Madrasah Ibtidaiyah, vol. 9, no. 1, p. 227, Jan. 2025, doi: 10.35931/am.v9i1.4115.
[5] A. Almosa, “Formulaic Sequences Used in Academic Writing Register,” Journal of Higher Education Theory and Practice, May 2024, doi: 10.33423/jhetp.v24i4.6948.
[6] I. N. Oktavianti and I. Prayogi, “Discourse functions of lexical bundles in Indonesian EFL learners’ argumentative essays: A corpus study,” Studies in English Language and Education, May 2022, doi: 10.24815/siele.v9i2.23995.
[7] P. Ardi, Y. D. Oktafiani, N. Widianingtyas, O. Dekhnich, and U. Widiati, “Lexical Bundles in Indonesian EFL Textbooks: A Corpus Analysis,” Journal of Language and Education, Jun. 2023, doi: 10.17323/jle.2023.16305.
[8] E. S. Lestari, I. N. Oktavianti, and R. A. Aziz, “Functional Categories of Lexical Bundles in Indonesian EFL Textbooks: A Corpus-Based Study,” Indonesian Journal of EFL and Linguistics, May 2025, doi: 10.21462/ijefl.v10i1.907.
[9] S. Yuliawati, D. Ekawati, and R. E. Mawarrani, “INVESTIGATING LEXICAL BUNDLES IN THE CORPORA OF ENGLISH AND INDONESIAN RESEARCH ARTICLES WITH THE SKETCH ENGINE,” Jurnal Sosioteknologi, Aug. 2021, doi: 10.5614/sostek.itbj.2021.20.2.5.
[10] E. Kurniawan and C. Permatasari, “Lexical bundles in accepted and rejected Scopus-indexed hard science research article introductions,” Studies in English Language and Education, May 2025, doi: 10.24815/siele.v12i2.38505.
[11] A. Budiwiyanto, “use of lexical bundles in an online comprehensive dictionary of Indonesian (KBBI Daring),” Lexicography, Jun. 2023, doi: 10.1558/lexi.25177.
[12] F. Etfita, S. Wahyuni, A. Ahmad, W. Sudusinghe, and C. K. W. Gamage, “Revealing lexical bundles of non-native English essay at a private Islamic university: A corpus-based evaluation,” English Learning Innovation, Aug. 2025, doi: 10.22219/englie.v6i2.40821.
[13] S. Lo, “Neural machine translation in EFL classrooms: learners’ vocabulary improvement, immediate vocabulary retention and delayed vocabulary retention,” Computer Assisted Language Learning, vol. 38, no. 3, pp. 592–611, 2025, doi: 10.1080/09588221.2023.2207603.
[14] A. Fawaid, P. Handayani, and Y. A. Abdillah, “E-Portofolio in Improving Critical Thinking and Self-Management through Lesson Study: A Study on Writing Pedagogy in Higher Education,” presented at the 2024 10th International Conference on Education and Technology (ICET), IEEE, 2024, pp. 149–154. doi: 10.1109/ICET64717.2024.10778453.
[15] N. Afifi, “Exploring the use of grammatical metaphor in Indonesian EFL learners’ academic writing,” Indonesian Journal of Applied Linguistics, Jan. 2021, doi: 10.17509/ijal.v10i3.31759.
[16] N. F. B. M. Ismail, A. S. B. Kamarulzaman, N. Z. B. Zainol, N. H. B. Nordin, and K. A. B. M. Jamil, “Lexical Bundle Patterns in Malaysian Educational Texts: A Corpus-Based Analysis of English Language Textbooks,” Malaysian Journal of Social Sciences and Humanities (MJSSH), Sep. 2024, doi: 10.47405/mjssh.v9i9.2986.
[17] A. T. Birhan, “Effects of Teaching Lexical Bundles on EFL Studentsâ€TM Abstract Genre Academic Writing Skills Improvement: Corpus-Based Research Design,” International Journal of Language Education, Mar. 2021, doi: 10.26858/ijole.v5i1.14917.
[18] A. B. Olani, A. Olani, T. B. Muleta, D. H. Rikitu, and K. G. Disassa, “Impacts of language barriers on healthcare access and quality among Afaan Oromoo-speaking patients in Addis Ababa, Ethiopia,” BMC Health Services Research, vol. 23, Jan. 2023, doi: 10.1186/s12913-023-09036-z.
[19] Y. Li and H. Lei, “Lexical Bundles in L1 and L2 English Academic Writing: Convergent and Divergent Usage,” SAGE Open, vol. 15, Apr. 2025, doi: 10.1177/21582440251333850.
[20] B. W. Y. Siu, M. Afzaal, H. S. Aldayel, and S. Curle, “Unlocking the Mysteries of Academic Writing: A Corpus-based Analysis of Lexical Bundles in L2 English for Engineering Students,” SAGE Open, vol. 14, Oct. 2024, doi: 10.1177/21582440241299997.
[21] A. D. Cahyanti and Y. Y. Dharmawan, “Reframing EFL Classrooms Students’ Perspectives on Translanguaging as a Pedagogical Strategy in Indonesian Senior High School,” Indonesian Journal of Teaching and Learning (INTEL), Aug. 2025, doi: 10.56855/intel.v4i3.1673.
[22] E. H. Hiebert, “Flattening the Developmental Staircase: Lexical Complexity Progression in Elementary Reading Texts Across Six Decades,” Education Sciences, vol. 15, no. 11, 2025, doi: 10.3390/educsci15111546.
[23] A. Alasmary, “Lexical bundles in psychology lectures and textbooks: a contrastive corpus-based study with implications for academic writing,” Frontiers in Psychology, vol. 16, Apr. 2025, doi: 10.3389/fpsyg.2025.1545355.
[24] A. Alhazmi, R. Mahmud, N. Idris, M. E. M. Abo, and C. Eke, “Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models,” PLOS ONE, vol. 19, Jul. 2024, doi: 10.1371/journal.pone.0305657.
[25] Juanda and I. Afandi, “Assessing text comprehension proficiency: Indonesian higher education students vs ChatGPT,” XLinguae, Jan. 2024, doi: 10.18355/xl.2024.17.01.04.
[26] A. Fawaid, R. Assyabani, I. Abdullah, C. Muali, M. S. Itqan, and S. Islam, “Human Intelligence and Algorithmic Precision: An Experimental Study of Indonesian Translation Pedagogy in Higher Education,” Asian Journal of University Education, vol. 21, no. 3, pp. 779–792, 2025, doi: 10.24191/ajue.v21i3.53.
[27] M. Bashori, “The development of intra-individual variability in academic writing: A study on lexical diversity and lexical sophistication,” SIELE: Studies in English Language and Education, vol. 8, no. 2, pp. 745–758, May 2021, doi: 10.24815/siele.v8i2.16843.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Abdul Wahid (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.










Creative Commons Attribution 4.0 International License