Improved multi-label classification using inter-dependence structure via a generative mixture model

Ramanuja Simha; Hagit Shatkay

doi:10.3233/978-1-61499-672-9-1336

Improved multi-label classification using inter-dependence structure via a generative mixture model

Ramanuja Simha, Hagit Shatkay

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Single-label classification associates each instance with a single label, while multi-label classification (MLC), assigns multiple labels to instances. Simple MLC systems assume that labels are independent of one another, while more complex approaches capture inter-dependencies among labels. Experiments comparing performance of MLC systems demonstrate that there is much room for improvement. Notably, when an instance is associated with multiple labels, a feature-value of the instance may depend only on a subset of these labels and thus be conditionally independent of the others given the label-subset. Current systems do not account for such conditional independence. Moreover, dependence of a feature-value on a label is likely to imply its dependence on other inter-dependent labels. Our hypothesis is that by explicitly modeling the dependence between feature values and specific subsets of inter-dependent labels, the assignment of multi-labels to instances can be done more accurately. We present a probabilistic generative model that captures dependencies among labels as well as between features and labels, by means of a Bayesian network. We introduce the concept of label dependency sets as a basis for a new mixture model that represents conditional independencies between features and labels given subsets of inter-dependent labels. Experimental results show that the performance of the system we have developed based on our model for MLC significantly improves upon results obtained by current MLC systems that are based on probabilistic models.

Original language	English (US)
Title of host publication	Frontiers in Artificial Intelligence and Applications
Editors	Gal A. Kaminka, Maria Fox, Paolo Bouquet, Eyke Hullermeier, Virginia Dignum, Frank Dignum, Frank van Harmelen
Publisher	IOS Press BV
Pages	1336-1343
Number of pages	8
ISBN (Electronic)	9781614996712
DOIs	https://doi.org/10.3233/978-1-61499-672-9-1336
State	Published - 2016
Externally published	Yes
Event	22nd European Conference on Artificial Intelligence, ECAI 2016 - The Hague, Netherlands Duration: Aug 29 2016 → Sep 2 2016

Publication series

Name	Frontiers in Artificial Intelligence and Applications
Volume	285
ISSN (Print)	0922-6389
ISSN (Electronic)	1879-8314

Conference

Conference	22nd European Conference on Artificial Intelligence, ECAI 2016
Country/Territory	Netherlands
City	The Hague
Period	8/29/16 → 9/2/16

ASJC Scopus subject areas

Artificial Intelligence

Access to Document

10.3233/978-1-61499-672-9-1336

Cite this

Simha, R., & Shatkay, H. (2016). Improved multi-label classification using inter-dependence structure via a generative mixture model. In G. A. Kaminka, M. Fox, P. Bouquet, E. Hullermeier, V. Dignum, F. Dignum, & F. van Harmelen (Eds.), Frontiers in Artificial Intelligence and Applications (pp. 1336-1343). (Frontiers in Artificial Intelligence and Applications; Vol. 285). IOS Press BV. https://doi.org/10.3233/978-1-61499-672-9-1336

Improved multi-label classification using inter-dependence structure via a generative mixture model. / Simha, Ramanuja; Shatkay, Hagit.
Frontiers in Artificial Intelligence and Applications. ed. / Gal A. Kaminka; Maria Fox; Paolo Bouquet; Eyke Hullermeier; Virginia Dignum; Frank Dignum; Frank van Harmelen. IOS Press BV, 2016. p. 1336-1343 (Frontiers in Artificial Intelligence and Applications; Vol. 285).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Simha, R & Shatkay, H 2016, Improved multi-label classification using inter-dependence structure via a generative mixture model. in GA Kaminka, M Fox, P Bouquet, E Hullermeier, V Dignum, F Dignum & F van Harmelen (eds), Frontiers in Artificial Intelligence and Applications. Frontiers in Artificial Intelligence and Applications, vol. 285, IOS Press BV, pp. 1336-1343, 22nd European Conference on Artificial Intelligence, ECAI 2016, The Hague, Netherlands, 8/29/16. https://doi.org/10.3233/978-1-61499-672-9-1336

Simha R, Shatkay H. Improved multi-label classification using inter-dependence structure via a generative mixture model. In Kaminka GA, Fox M, Bouquet P, Hullermeier E, Dignum V, Dignum F, van Harmelen F, editors, Frontiers in Artificial Intelligence and Applications. IOS Press BV. 2016. p. 1336-1343. (Frontiers in Artificial Intelligence and Applications). doi: 10.3233/978-1-61499-672-9-1336

Simha, Ramanuja ; Shatkay, Hagit. / Improved multi-label classification using inter-dependence structure via a generative mixture model. Frontiers in Artificial Intelligence and Applications. editor / Gal A. Kaminka ; Maria Fox ; Paolo Bouquet ; Eyke Hullermeier ; Virginia Dignum ; Frank Dignum ; Frank van Harmelen. IOS Press BV, 2016. pp. 1336-1343 (Frontiers in Artificial Intelligence and Applications).

@inproceedings{1cccf12357a24564ae5658c6676e577f,

title = "Improved multi-label classification using inter-dependence structure via a generative mixture model",

abstract = "Single-label classification associates each instance with a single label, while multi-label classification (MLC), assigns multiple labels to instances. Simple MLC systems assume that labels are independent of one another, while more complex approaches capture inter-dependencies among labels. Experiments comparing performance of MLC systems demonstrate that there is much room for improvement. Notably, when an instance is associated with multiple labels, a feature-value of the instance may depend only on a subset of these labels and thus be conditionally independent of the others given the label-subset. Current systems do not account for such conditional independence. Moreover, dependence of a feature-value on a label is likely to imply its dependence on other inter-dependent labels. Our hypothesis is that by explicitly modeling the dependence between feature values and specific subsets of inter-dependent labels, the assignment of multi-labels to instances can be done more accurately. We present a probabilistic generative model that captures dependencies among labels as well as between features and labels, by means of a Bayesian network. We introduce the concept of label dependency sets as a basis for a new mixture model that represents conditional independencies between features and labels given subsets of inter-dependent labels. Experimental results show that the performance of the system we have developed based on our model for MLC significantly improves upon results obtained by current MLC systems that are based on probabilistic models.",

author = "Ramanuja Simha and Hagit Shatkay",

note = "Publisher Copyright: {\textcopyright} 2016 The Authors and IOS Press.; 22nd European Conference on Artificial Intelligence, ECAI 2016 ; Conference date: 29-08-2016 Through 02-09-2016",

year = "2016",

doi = "10.3233/978-1-61499-672-9-1336",

language = "English (US)",

series = "Frontiers in Artificial Intelligence and Applications",

publisher = "IOS Press BV",

pages = "1336--1343",

editor = "Kaminka, {Gal A.} and Maria Fox and Paolo Bouquet and Eyke Hullermeier and Virginia Dignum and Frank Dignum and {van Harmelen}, Frank",

booktitle = "Frontiers in Artificial Intelligence and Applications",

address = "Netherlands",

}

TY - GEN

T1 - Improved multi-label classification using inter-dependence structure via a generative mixture model

AU - Simha, Ramanuja

AU - Shatkay, Hagit

PY - 2016

Y1 - 2016

N2 - Single-label classification associates each instance with a single label, while multi-label classification (MLC), assigns multiple labels to instances. Simple MLC systems assume that labels are independent of one another, while more complex approaches capture inter-dependencies among labels. Experiments comparing performance of MLC systems demonstrate that there is much room for improvement. Notably, when an instance is associated with multiple labels, a feature-value of the instance may depend only on a subset of these labels and thus be conditionally independent of the others given the label-subset. Current systems do not account for such conditional independence. Moreover, dependence of a feature-value on a label is likely to imply its dependence on other inter-dependent labels. Our hypothesis is that by explicitly modeling the dependence between feature values and specific subsets of inter-dependent labels, the assignment of multi-labels to instances can be done more accurately. We present a probabilistic generative model that captures dependencies among labels as well as between features and labels, by means of a Bayesian network. We introduce the concept of label dependency sets as a basis for a new mixture model that represents conditional independencies between features and labels given subsets of inter-dependent labels. Experimental results show that the performance of the system we have developed based on our model for MLC significantly improves upon results obtained by current MLC systems that are based on probabilistic models.

AB - Single-label classification associates each instance with a single label, while multi-label classification (MLC), assigns multiple labels to instances. Simple MLC systems assume that labels are independent of one another, while more complex approaches capture inter-dependencies among labels. Experiments comparing performance of MLC systems demonstrate that there is much room for improvement. Notably, when an instance is associated with multiple labels, a feature-value of the instance may depend only on a subset of these labels and thus be conditionally independent of the others given the label-subset. Current systems do not account for such conditional independence. Moreover, dependence of a feature-value on a label is likely to imply its dependence on other inter-dependent labels. Our hypothesis is that by explicitly modeling the dependence between feature values and specific subsets of inter-dependent labels, the assignment of multi-labels to instances can be done more accurately. We present a probabilistic generative model that captures dependencies among labels as well as between features and labels, by means of a Bayesian network. We introduce the concept of label dependency sets as a basis for a new mixture model that represents conditional independencies between features and labels given subsets of inter-dependent labels. Experimental results show that the performance of the system we have developed based on our model for MLC significantly improves upon results obtained by current MLC systems that are based on probabilistic models.

UR - http://www.scopus.com/inward/record.url?scp=85013074928&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85013074928&partnerID=8YFLogxK

U2 - 10.3233/978-1-61499-672-9-1336

DO - 10.3233/978-1-61499-672-9-1336

M3 - Conference contribution

AN - SCOPUS:85013074928

T3 - Frontiers in Artificial Intelligence and Applications

SP - 1336

EP - 1343

BT - Frontiers in Artificial Intelligence and Applications

A2 - Kaminka, Gal A.

A2 - Fox, Maria

A2 - Bouquet, Paolo

A2 - Hullermeier, Eyke

A2 - Dignum, Virginia

A2 - Dignum, Frank

A2 - van Harmelen, Frank

PB - IOS Press BV

T2 - 22nd European Conference on Artificial Intelligence, ECAI 2016

Y2 - 29 August 2016 through 2 September 2016

ER -

Improved multi-label classification using inter-dependence structure via a generative mixture model

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this