Semantic vector evaluation and human performance on a new vocabulary MCQ test

Joseph Patrick Levy, John Bullinaria, Samantha McCormick

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

201 Downloads (Pure)

Abstract

Vectors derived from patterns of co-occurrence of words in large bodies of text have often been used as representations of some aspects of the meanings of different words. Generally, the distance between such vectors is used as a measure of the semantic similarity between the word meanings they represent. One important way of evaluating the performance of these vectors has been to use them to answer vocabulary multiple choice questions (MCQs) where the participant is asked to judge which of several choice words is closest in meaning to a stem word. The existing vocabulary MCQ tests used in this way have been very useful but there are some practical problems in their use as general evaluation measures. Here, we discuss why such tests remain useful evaluation measures, introduce a new vocabulary test, evaluate several current sets of semantic vectors using the new test and compare their performance to human data.
Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the Cognitive Science Society
Subtitle of host publicationCogSci 2017 London: “Computational Foundations of Cognition”
PublisherCognitive Science Society
Number of pages6
Publication statusAccepted/In press - 11 Apr 2017

Keywords

  • Distributional semantics; vocabulary MCQ.

Cite this