2018

2018-01-11

Adjusting Word Embeddings with Semantic Intensity Orders

clustering words for intensity ordering首先需要从Google N-gram中抽取形容词的程度顺序信息。使用的方法是基于模版匹配的方法。比如从good but not great我们可以总结出一个规则xx but not xx，然后可以发现后者比前者的程度深。然后使用mixed integer linear programming (MILP)来进行最优化排序

Papers

2018-01-10

Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

他们想增强一些能够明显区分同义词和反义词的特征。

Papers

2018-01-09

Ontologically Grounded Multi-sense Representation Learning for Semantic Vector Space Models

IntroductionIn this paper thye propose two novel and general approaches for generating sense-specific word embeddings that are grounded in an ontology. 但是通用的词向量只为一个词分配一个词向量，无法解决解决一词多义的问题。虽然Yu and Dre

Papers

2018-01-07

AutoExtend Extending Word Embeddings to Embeddings for Synsets and Lexemes

Introduction这篇文章提出了一个叫做AutoExtand的方法，来运用其他的信息来增强word embedding的性能。用于学习用来表示synsets和lexemes。用于一些公开的知识库上，比如WordNet, Wikipedia和Freebase。 synset是指一个词语集合，这里面的词可以在一定的条件下相互替换。lexeme会将一个特定的拼写和发音和一个特定的意思匹配在一起。也

2017

Papers

2017-12-24

Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

这篇文章想用同一语言和跨语言资源中获得的约束来增强词向量的质量。

Papers

2017-10-31

Mimicking Word Embeddings using Subword RNNs

这篇文章挺有意思的，在已有的word embeddings上学习一个从字符级别的序列上建立一个word embedding。模型使用的是RNN，双向的，输入是一个单词，输出就是一个向量。训练的时候输入都是已有word embedding中lexicon的单词，输入的ground truth是原始的向量。这么做的目的是希望能够解决UNK（未登录词）的表示，它的理论假设是从字母组成语义是要遵循一系列

Papers

2017-10-30

Dynamic Routing Between Capsules

IntroductionThis paper is going to show that a discrimininatively trained, multi-layer capsule system achieves state-of-the-art performance on MNIST and isconsiderably better than a convolutional net

Papers

2017-09-06

Information-Theory Interpretation of the Skip-Gram Negative-Sampling Objective Function

A dependency measure based on Jensen-ShannonThey define a dependency measure between two random variables, which is based on the Jensen-Shannon divergence. The Kullback-Leibler (KL) divergence of a d

Papers

2017-09-05

Improved Word Representation Learning with Sememes

MethologyHowNetHowNet annotates precise senses to each word, and for each sense, HowNet annotates the significance of parts and attributes represented by sememes.

Papers

2017-08-27

Not All Contexts Are Created Equal:Better Word Representations with Variable Attention

AbstractThey introduce an extension to the bag-of-words model for learning words represen- tations that take into account both syntactic and semantic properties within language. IntroductionFor BOW an

Blog

分类：: Papers

Adjusting Word Embeddings with Semantic Intensity Orders

Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction

Ontologically Grounded Multi-sense Representation Learning for Semantic Vector Space Models

AutoExtend Extending Word Embeddings to Embeddings for Synsets and Lexemes

Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

Mimicking Word Embeddings using Subword RNNs

Dynamic Routing Between Capsules

Information-Theory Interpretation of the Skip-Gram Negative-Sampling Objective Function

Improved Word Representation Learning with Sememes

Not All Contexts Are Created Equal:Better Word Representations with Variable Attention