Idiap on LinkedIn Idiap youtube channel Idiap on Twitter Idiap on Facebook
Personal tools
You are here: Home Research Resources wmil-sgd: A weighted multiple-instance learning algorithm based on stochastic gradient descent

wmil-sgd: A weighted multiple-instance learning algorithm based on stochastic gradient descent

— filed under:

This is a Python implementation of the weighted multiple-instance learning (WMIL) algorithm based on stochastic gradient descent described in [1], extending our earlier proposal [2].

The WMIL-SGD algorithm is a weakly supervised learning model, which jointly learns to focus on relevant parts of a document according to the context along with a classifier for the target categories. The model takes as input a document (bag), which consists of multiple input vectors (instances), possibly from a neural network. The model learns to compute a weighted average of these vectors by estimating the weights for each document and target categories. We have applied WMIL-SGD to multi-aspect sentiment analysis, segmentation, and summarization. We have shown that the weights predicted by WMIL-SGD match human estimates of the importance of each sentence on a dataset of audiobook reviews [3].

[1] Pappas N. & Popescu-Belis A. (2017) - Explicit Document Modeling through Weighted Multiple-Instance Learning. Journal of Artificial Intelligence Research (JAIR).

[2] Pappas N. & Popescu-Belis A. (2014) - Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis. Proceedings of EMNLP 2014 (Conference on Empirical Methods in Natural Language Processing), Doha, Qatar, p.455-466.

[3] Pappas N. & Popescu-Belis A. (2016) - Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations. Proceedings of the EMNLP 2016 SocialNLP workshop (4th International Workshop on Natural Language Processing for Social Media), Austin, TX, p. 94-100.

Document Actions
Resource Information
Resource type: software
URL: https://github.com/nik0spapp/wmil-sgd/
Date: Mar 01, 2017
Nature: Model
Lifespan: Indefinite
Size: 6KB
Audience: Anyone
Access: Public (via git repository)
Ownership: Idiap Research Institute
Distribution: Web
License:
GNU GENERAL PUBLIC LICENSE Version 3, 29 June 2007
Contact: Nikolaos PAPPAS
+41 277 217 814
Andrei POPESCU-BELIS
+41 277 217 729