Identification of authorship based on vectorization of lyrics by Anna Akhmatova and Marina Tsvetaeva with psychophysiological methods

Authors

DOI:

https://doi.org/10.33910/2687-0223-2023-5-1-4-13

Keywords:

2B-PLS, poems by A. Akhmatova, poems by M. Tsvetaeva, differentiation of authors, vectorization of texts

Abstract

Purposeful dialog systems can employ an authorship determinant in order to obtain a more accurate answer which addresses a specific user and takes into account his features. This article discusses the possibility of distinguishing the works of two authors based on the results of vectorization of their texts using methods that are successfully used in interdisciplinary research.

2B-PLS (Two-Block Projection to Latent Structure) has demonstrated high efficiency in analyzing the results of interdisciplinary research in neurolinguistics, psychophysiology and other fields.

The study described in this article involved the lyrics by Anna Akhmatova and Marina Tsvetaeva, whose works are nowadays researched by many scholars. The author selected 310 poetic texts: 196 poems by Akhmatova, and 114, by Tsvetaeva. The parameters for the analysis were the results of text vectorization: proportions of verbs in the text, proper names, adjectives, adverbs, unique words in the text, punctuation marks, functional parts of speech and significant parts of speech, average line length, number of lines, variety of punctuation marks.

The 2B-PLS analysis based on the vectorization results for the texts in question showed clear differences between the works of the two poets. The author discussed these findings with scholars of Akhmatova and Tsvetaeva.

The lyrics of Akhmatova (as compared to those of Tsvetaeva) are characterized by more frequent use of verbs, adjectives, adverbs, service parts of speech, significant parts of speech, as well as more lines and more variety of punctuation marks.

The lyrics of Tsvetaeva (as compared to those of Akhmatova) are characterized by longer lines and a more diverse vocabulary, as well as more frequent use of punctuation marks, nouns and proper names.

The results obtained correlate with theoretical studies.

References

ЛИТЕРАТУРА

Аванесян, Н. Л., Соловьев, Ф. Н., Чеповский, А. А. (2021) Характеристики текстов сообществ социальных сетей. Вестник Новосибирского государственного университета. Серия: Информационные технологии, т. 19, № 1, с. 5–14.

Ашимова, А. Ф., Юсуфов, М. Г., Юсуфова, Л. О. (2017) Расчлененные синтаксические структуры со вставной конструкцией в художественной прозе М. Цветаевой. Вестник Социально-педагогического института, № 3 (23), с. 30–36.

Гречачин, В. А. (2018) Статистические методы в исследовании текстов. Вестник Башкирского университета, т. 23, № 3, с. 772–776.

Ковалева, В. Ю., Поздняков, А. А., Литвинов, Ю. Н., Ефимов, В. М. (2019) Оценка сопряженности морфогенетических и молекулярно-генетических модулей изменчивости серых полевок Microtus S.L. в градиентных условиях среды. Экологическая генетика, т. 17, № 2, с. 21–34. https://doi.org/10.17816/ecogen17221-34

Ковынева, И. А., Скляр, Е. С. (2019) Функционально-семантические особенности окказионализмов в поэзии А. А. Фета, Ф. И. Тютчева и М. И. Цветаевой. Балтийский гуманитарный журнал, т. 8, № 1 (26), с. 86–88.

Козловская, А. А. (2021) Личные формы предикатива с семантикой позитивной экспрессии в лирике А. Ахматовой. Верхневолжский филологический вестник, № 1 (24), с. 104–111. https://doi.org/10.20323/2499-9679-2021-1-24-104-111

Мамаджанова, Х. К. (2020) Обстоятельный анализ синонимов в поэзии Марины Цветаевой, как демонстрирование номинативного варьирования. Вестник Педагогического университета, № 4 (87), с. 46–50.

Марзаганова, Л. М. (2020) Особенности лирики Марины Цветаевой. Молодой ученый, № 3 (293), с. 134–136.

Могушкова, Т. С. (2021) Сборник стихов «Вечерний альбом» Марины Ивановны Цветаевой. Вестник науки, т. 3, № 4 (37), с. 26–30.

Пимешков, В. К., Диковицкий, В. В., Шишаев, М. Г. (2020) Извлечение отношений тезауруса из текстов на естественном языке с использованием статистических и лингвистических методов. Труды Кольского научного центра РАН. Информационные технологии, № 8-11, с. 188–192. https://doi.org/10.37614/2307-5252.2020.8.11.028

Ревзина, О. Г. (2009) Безмерная Цветаева: опыт системного описания поэтического идиолекта. М.: Дом- музей Марины Цветаевой, 594 с.

Рижинашвили, А. Л. (2018) Методы статистического анализа текстов научных публикаций в работе историка науки. Проблемы деятельности ученого и научных коллективов, № 4 (34), с. 76–86.

Ризванова, Н. А., Кадырова, К. А. (2020) Композиционная роль заглавия, посвящения и подзаголовка в поэме Марины Цветаевой «Крысолов». Мир науки, культуры, образования, № 3 (82), с. 369–371. https://doi.org/10.24411/1991-5497-2020-00579

Русская литература 1920–1930 гг. (2015) StudFiles. [Электронный ресурс]. URL: https://studfile.net/preview/2681017/ (дата обращения 19.11.2022).

Самохина, Е. Н., Бобков, Д. В. (2021) Особенности перевода стихотворений на язык хинди на примере творчества Марины Цветаевой и Анны Ахматовой. Казанский вестник молодых ученых, т. 5, № 5, с. 47–52.

Твердохлеб, О. Г. (2017) Предикативное употребление компаратива в поэзии А. А. Ахматовой. Вестник Пермского университета. Российская и зарубежная филология, т. 9, № 1, с. 77–86. https://doi.org/10.17072/2037-6681-2017-1-77-86

Ульченко, А. Н. (2021) Автоматическая замена литературных слов на разговорные аналоги. В кн.: World Science: Problems and Innovations: сборник статей LIV Международной научно-практической конференции. Пенза: Наука и Просвещение, с. 62–65.

Хадзиева, А. А., Нальгиева, А. А. (2018) Особенности творчества Анны Ахматовой. Проблемы современной науки и образования, № 2 (122), с. 35–37.

Цветкова, М. В. (2017) Поэт, поэзия и творческий дар в художественном мире Марины Цветаевой. Ученые записки Казанского университета. Серия: Гуманитарные науки, т. 159, № 1, с. 138–153.

Чжан, В. (2022) Мужество как философия бытия (на материале творчества А. Ахматовой и О. Берггольц периода Великой Отечественной войны). Филологические науки. Вопросы теории и практики, т. 15, № 2, с. 298–303. https://doi.org/10.30853/phil20210681

Krivoshchekov, S. G., Nikolaeva, E. I., Vergunov, E. G., Prihodko, A. Yu. (2022) Multivariate analysis of indicators of inhibitory and autonomic control in orthostasis and emotional situations. Human Physiology, vol. 48, no. 1, pp. 20–29. https://doi.org/10.1134/S0362119721060050

Nikolaeva, E. I., Efimova, V. L., Vergunov, E. G. (2022) Integration of vestibular and auditory information in ontogenesis. Children, vol. 9, no. 3, article 401. https://doi.org/10.3390/children9030401

Polunin, D., Shtaiger, I., Efimov, V. (2019) JACOBI4 software for multivariate analysis of biological data. bioRxiv. [Online]. Available at: https://doi.org/10.1101/803684 (accessed 19.11.2022).

Rännar, S., Lindgren, F., Geladi, P., Wold, S. (1994) A PLS kernel algorithm for data sets with many variables and fewer objects. Part 1: Theory and algorithm. Journal of Chemometrics, vol. 8, no. 2, pp. 111–125. https://doi.org/10.1002/cem.1180080204

Rohlf, F. J., Corti, M. (2000) Use of two-block partial least-squares to study covariation in shape. Systematic Biology, vol. 49, no. 4, pp. 740–753. https://doi.org/10.1080/106351500750049806

Savostyanov, A. N., Vergunov, E. G., Saprygin, A. E., Lebedkin, D. A. (2022) Validation of a face image assessment technology to study the dynamics of human functional states in the EEG resting-state paradigm. Vavilov Journal of Genetics and Breeding, vol. 26, no. 8, pp. 765–772. https://doi.org/10.18699/VJGB-22-92

Vergunov, E. G. (2022) Coping space transformation at different levels of university training during the pandemic and the assessment of its integral indicators. Kompleksnye issledovaniya detstva — Comprehensive Child Studies, vol. 4, № 2, pp. 115–123. https://doi.org/10.33910/2687-0223-2022-4-2-115-123

REFERENCES

Avanesyan, N. L., Solovev, F. N., Chepovskij, A. A. (2021) Kharakteristiki tekstov soobshchestv sotsial’nykh setej [Characteristics of texts of social networks communities]. Vestnik Novosibirskogo gosudarstvennogo universiteta. Seriya: Informacionnye tekhnologii — Vestnik NSU. Series: Information Technologies, vol. 19, no. 1, pp. 5–14. (In Russian)

Ashimova, A. F., Yusufov, M. G., Yusufova, L. O. (2017) Raschlenennye sintaksicheskie struktury so vstavnoj konstruktsiej v khudozhestvennoj proze M. Tsvetaevoj [Dissected syntactis structures with an inserted construction in M. Tsvetaeva’s artistic prose]. Vestnik Sotsial’no-pedagogicheskogo instituta — Bulletin of the Socio-Pedagogical Institute, no. 3 (23), pp. 30–36. (In Russian)

Grechachin, V. A. (2018) Statisticheskie metody v issledovanii tekstov [Statistical methods in the study of texts]. Vestnik Bashkirskogo universiteta — Bulletin of Bashkir University, vol. 23, no. 3, pp. 772–776. (In Russian)

Khadzieva, A. A., Nalgieva, A. A. (2018) Osobennosti tvorchestva Anny Akhmatovoj [Features of Anna Akhmatova’s creativity]. Problemy sovremennoj nauki i obrazovaniya — Modern Problems of Science and Education, no. 2 (122), pp. 35–37. (In Russian)

Kovaleva, V. Yu., Pozdnyakov, A. A., Litvinov, Yu. N., Efimov, V. M. (2019) Otsenka sopryazhennosti morfogeneticheskikh i molekulyarno-geneticheskikh modulej izmenchivosti serykh polevok Microtus S. L. v gradientnykh usloviyakh sredy [Estimation of the congruence between morphogenetic and molecular-genetic modules of gray voles Microtus S.L. variability along a climatic gradient]. Ekologicheskaya genetika — Ecological Genetics, vol. 17, no. 2, pp. 21–34. https://doi.org/10.17816/ecogen17221-34 (In Russian)

Kovyneva, I. A., Sklyar, E. S. (2019) Funktsional’no-semanticheskie osobennosti okkazionalizmov v poezii A. A. Feta, F. I. Tyutcheva i M. I. Tsvetaevoj [Functional semantic particularities of occasional words in Afanasy Fet’s, Fyodor Tyutchev’s and Marina Tsvetaeva’s poetry]. Baltijskij gumanitarnyj zhurnal — Baltic Humanitarian Journal, vol. 8, no. 1 (26), pp. 86–88. (In Russian)

Kozlovskaya, A. A. (2021) Lichnye formy predikativa s semantikoj pozitivnoj ekspressii v lirike A. Akhmatovoj [Personal predicatives with the semantics of positive expression in A. Akhmatova’s poetry]. Verkhnevolzhskij filologicheskij vestnik — Verhnevolzhski Philological Bulletin, no. 1 (24), pp. 104–111. https://doi.org/10.20323/2499-9679-2021-1-24-104-111 (In Russian)

Krivoshchekov, S. G., Nikolaeva, E. I., Vergunov, E. G., Prihodko, A. Yu. (2022) Multivariate analysis of indicators of inhibitory and autonomic control in orthostasis and emotional situations. Human Physiology, vol. 48, no. 1, pp. 20–29. https://doi.org/10.1134/S0362119721060050 (In English)

Mamadzhanova, Kh. K. (2020) Obstoyatel’nyj analiz sinonimov v poezii Mariny Tsvetaevoj, kak demonstrirovanie nominativnogo var’irovaniya [Detailed analysis of synonymy in Marina Svetaeva poetry, as a manifestation of nominative variation]. Vestnik Pedagogicheskogo universiteta — Herald of the Pedagogical University, no. 4 (87), pp. 46–50. (In Russian)

Marzaganova, L. M. (2020) Osobennosti liriki Mariny Tsvetaevoj [Features of the lyrics of Marina Tsvetaeva]. Molodoj uchenyj, no. 3 (293), pp. 134–136. (In Russian)

Mogushkova, T. S. (2021) Sbornik stikhov “Vechernij al’bom” Mariny Ivanovny Tsvetaevoj [The collection of poems “Evening album” of Marina Tsvetaeva]. Vestnik nauki, vol. 3, no. 4 (37), pp. 26–30. (In Russian)

Nikolaeva, E. I., Efimova, V. L., Vergunov, E. G. (2022) Integration of vestibular and auditory information in ontogenesis. Children, vol. 9, no. 3, article 401. https://doi.org/10.3390/children9030401 (In English)

Pimeshkov, V. K., Dikovitskij, V. V., Shishaev, M. G. (2020) Izvlechenie otnoshenij tezaurusa iz tekstov na estestvennom yazyke s ispol’zovaniem statisticheskikh i lingvisticheskikh metodov [Extraction of relation from natural language texts using statistical and linguistic methods]. Trudy Kol’skogo nauchnogo tsentra RAN. Informatsionnye tekhnologii — Transactions of the Kola Scientific Center. Information Technologies, no. 8-11, pp. 188–192. https://doi.org/10.37614/2307-5252.2020.8.11.028 (In Russian)

Polunin, D., Shtaiger, I., Efimov, V. (2019) JACOBI4 software for multivariate analysis of biological data. bioRxiv. [Online]. Available at: https://doi.org/10.1101/803684 (accessed 19.11.2022). (In English)

Rännar, S., Lindgren, F., Geladi, P., Wold, S. (1994) A PLS kernel algorithm for data sets with many variables and fewer objects. Part 1: Theory and algorithm. Journal of Chemometrics, vol. 8, no. 2, pp. 111–125. https://doi.org/10.1002/cem.1180080204 (In English)

Revzina, O. G. (2009) Bezmernaya Tsvetaeva: opyt sistemnogo opisaniya poeticheskogo idiolekta [Immeasurable Tsvetaeva: The experience of a systematic description of a poetic idiolect]. Moscow: Marina Tsvetaeva’s House Museum Publ., 594 p. (In Russian)

Rizhinashvili, A. L. (2018) Metody statisticheskogo analiza tekstov nauchnykh publikatsij v rabote istorika nauki [The methods of statistical analysis of texts of scientific publications in a work of historian of science]. Problemy deyatel’nosti uchenogo i nauchnykh kollektivov — The Problems of Scientist and Scientific Groups Activity, no. 4 (34), pp. 76–86. (In Russian)

Rizvanova, N. A., Kadyrova, K. A. (2020) Kompozitsionnaya rol’ zaglaviya, posvyashcheniya i podzagolovka v poeme Mariny Tsvetaevoj “Krysolov” [The compositional role of the title, dedication, subtitle in Marina Tsvetaeva’s poem “The Rat-Man”]. Mir nauki, kul’tury, obrazovaniya — The World of Science, Culture and Education, no. 3 (82), pp. 369–371. https://doi.org/10.24411/1991-5497-2020-00579 (In Russian)

Rohlf, F. J., Corti, M. (2000) Use of two-block partial least-squares to study covariation in shape. Systematic Biology, vol. 49, no. 4, pp. 740–753. https://doi.org/10.1080/106351500750049806 (In English)

Russkaya literatura 1920–1930 gg. [Russian Literature 1920–1930]. (2015) StudFiles. [Online]. Available at: https://studfile.net/preview/2681017/ (accessed 19.11.2022). (In Russian)

Samokhina, E. N., Bobkov, D. V. (2021) Osobennosti perevoda stikhotvorenij na yazyk khindi na primere tvorchestva Mariny Tsvetaevoj i Anny Akhmatovoj [Features of translation of poems into hindi language on the example of the works of Marina Tsvetaeva and Anna Akhmatova]. Kazanskij vestnik molodykh uchenykh — Kazan Bulletin of Young Scientists, vol. 5, no. 5, pp. 47–52. (In Russian)

Savostyanov, A. N., Vergunov, E. G., Saprygin, A. E., Lebedkin, D. A. (2022) Validation of a face image assessment technology to study the dynamics of human functional states in the EEG resting-state paradigm. Vavilov Journal of Genetics and Breeding, vol. 26, no. 8, pp. 765–772. https://doi.org/10.18699/VJGB-22-92 (In English)

Tsvetkova, M. V. (2017) Poet, poeziya i tvorcheskij dar v khudozhestvennom mire Mariny Tsvetaevoj [Poet, poetry, and gift of creativity in Marina Tsvetaeva’s poetic world]. Uchenye zapiski Kazanskogo universiteta. Seriya: Gumanitarnye nauki — Proceedings of Kazan University. Humanities Series, vol. 159, no. 1, pp. 138–153. (In Russian)

Tverdokhleb, O. G. (2017) Predikativnoe upotreblenie komparativa v poezii A. A. Akhmatovoj [Predicative use of the comparative in the poetry of A. A. Akhmatova]. Vestnik Permskogo universiteta. Rossijskaya i zarubezhnaya filologiya — Perm University Herald. Russian and Foreign Philology, vol. 9, no. 1, pp. 77–86. https://doi. org/10.17072/2037-6681-2017-1-77-86 (In Russian)

Ulchenko, A. N. (2021) Avtomaticheskaya zamena literaturnykh slov na razgovornye analogi [Automatic replacement of literary words with colloquial counterparts]. In: World Science: Problems and Innovations: sbornik statej LIV Mezhdunarodnoj nauchno-prakticheskoj konferentsii [World Science: Problems and Innovations: Proceedings of the LIV International Scientific-Practical Conference]. Penza: Nauka i Prosveshchenie Publ., pp. 62–65. (In Russian)

Vergunov, E. G. (2022) Coping space transformation at different levels of university training during the pandemic and the assessment of its integral indicators. Kompleksnye issledovaniya detstva — Comprehensive Child Studies, vol. 4, no. 2, pp. 115–123. https://doi.org/10.33910/2687-0223-2022-4-2-115-123 (In English)

Zhang, W. (2022) Muzhestvo kak filosofiya bytiya (na materiale tvorchestva A. Akhmatovoj i O. Berggol’ts perioda Velikoj Otechestvennoj vojny) [Courage as philosophy of being (based on the creative work of A. Akhmatova and O. Bergholz during the Great Patriotic War)]. Filologicheskie nauki. Voprosy teorii i praktiki — Philology. Theory & Practice, vol. 15, no. 2, pp. 298–303. https://doi.org/10.30853/phil20210681 (In Russian)

Published

2023-04-03

Issue

Section

Articles