Informatics and Automation
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Informatics and Automation:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Informatics and Automation, 2022, Issue 21, volume 5, Pages 916–936
DOI: https://doi.org/10.15622/ia.21.5.3
(Mi trspy1213)
 

This article is cited in 1 scientific paper (total in 1 paper)

Artificial Intelligence, Knowledge and Data Engineering

Opening the black box: Finding Osgood's semantic factors in word2vec space

I. Surov

ITMO University
Abstract: State-of-the-art models of artificial intelligence are developed in the black-box paradigm, in which sensitive information is limited to input-output interfaces, while internal representations are not interpretable. The resulting algorithms lack explainability and transparency, requested for responsible application. This paper addresses the problem by a method for finding Osgood’s dimensions of affective meaning in multidimensional space of a pre-trained word2vec model of natural language. Three affective dimensions are found based on eight semantic prototypes, composed of individual words. Evaluation axis is found in 300-dimensional word2vec space as a difference between positive and negative prototypes. Potency and activity axes are defined from six process-semantic prototypes (perception, analysis, planning, action, progress, and evaluation), representing phases of a generalized circular process in that plane. All dimensions are found in simple analytical form, not requiring additional training. Dimensions are nearly orthogonal, as expected for independent semantic factors. Osgood’s semantics of any word2vec object is then retrieved by a simple projection of the corresponding vector to the identified dimensions. The developed approach opens the possibility for interpreting the inside of black box-type algorithms in natural affective-semantic categories, and provides insights into foundational principles of distributive vector models of natural language. In the reverse direction, the established mapping opens machine-learning models as rich sources of data for cognitive-behavioral research and technology.
Keywords: semantics, dimension, Osgood, affective meaning, interpretation, word2vec, language, black box.
Funding agency Grant number
Russian Science Foundation 20-71-00136
This research is supported by RNF (grant № 20-71-00136).
Received: 18.07.2022
Document Type: Article
UDC: 004.8
Language: English
Citation: I. Surov, “Opening the black box: Finding Osgood's semantic factors in word2vec space”, Informatics and Automation, 21:5 (2022), 916–936
Citation in format AMSBIB
\Bibitem{Sur22}
\by I.~Surov
\paper Opening the black box: Finding Osgood's semantic factors in word2vec space
\jour Informatics and Automation
\yr 2022
\vol 21
\issue 5
\pages 916--936
\mathnet{http://mi.mathnet.ru/trspy1213}
\crossref{https://doi.org/10.15622/ia.21.5.3}
Linking options:
  • https://www.mathnet.ru/eng/trspy1213
  • https://www.mathnet.ru/eng/trspy/v21/i5/p916
  • This publication is cited in the following 1 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Informatics and Automation
    Statistics & downloads:
    Abstract page:109
    Full-text PDF :42
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024