TY - GEN
T1 - Shaping datasets
T2 - 23rd IEEE International Conference on Image Processing, ICIP 2016
AU - Vonikakis, Vassilios
AU - Subramanian, Ramanathan
AU - Winkler, Stefan
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/8/3
Y1 - 2016/8/3
N2 - This paper presents a method for dataset manipulation based on Mixed Integer Linear Programming (MILP). The proposed optimization can narrow down a dataset to a particular size, while enforcing specific distributions across different dimensions. It essentially leverages the redundancies of an initial dataset in order to generate more compact versions of it, with a specific target distribution across each dimension. If the desired target distribution is uniform, then the effect is balancing: all values across all different dimensions are equally represented. Other types of target distributions can also be specified, depending on the nature of the problem. The proposed approach may be used in machine learning, for shaping training and testing datasets, or in crowdsourcing, for preparing datasets of a manageable size.
AB - This paper presents a method for dataset manipulation based on Mixed Integer Linear Programming (MILP). The proposed optimization can narrow down a dataset to a particular size, while enforcing specific distributions across different dimensions. It essentially leverages the redundancies of an initial dataset in order to generate more compact versions of it, with a specific target distribution across each dimension. If the desired target distribution is uniform, then the effect is balancing: all values across all different dimensions are equally represented. Other types of target distributions can also be specified, depending on the nature of the problem. The proposed approach may be used in machine learning, for shaping training and testing datasets, or in crowdsourcing, for preparing datasets of a manageable size.
KW - Balancing
KW - Crowdsourcing
KW - Datasets
KW - Mixed Integer Linear Programming (MILP)
UR - http://www.scopus.com/inward/record.url?scp=85006746755&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2016.7533061
DO - 10.1109/ICIP.2016.7533061
M3 - Conference contribution
AN - SCOPUS:85006746755
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 3753
EP - 3757
BT - 2016 IEEE International Conference on Image Processing, ICIP 2016 - Proceedings
A2 - Pereira, Fernando
A2 - Sharma, Gaurav
PB - IEEE, Institute of Electrical and Electronics Engineers
CY - United States
Y2 - 25 September 2016 through 28 September 2016
ER -