Data analysis on big data applications with small samples and incomplete information

Soizic Linford, Benjamin Bogdanovic, Kuo Ming Chao, Sladana Jankovic, Vladislav Maras, Mirjana Bugarinovic, Ilias Trochidis

Research output: Chapter in Book/Report/Conference proceedingConference proceeding

Abstract

The EU and other public organizations at different levels of national and local government across the world have funded and invested in numerous research and development projects on big data transport applications over last few years. The mid and long term effectiveness of these applications is very difficult to measure, and the benefits and usability of these applications are not easy to calculate. NOESIS, funded under EU H2020 program, aims to design a decision supported tool by gathering and analyzing these applications as use cases to formulate sufficient knowledge for policy makers to make informed decisions for their big data transport applications. The challenges in this work are associated with a small number of samples, with incomplete information, but having a good size of features that need to be analyzed to make a confident enough recommendation. This paper reports various statistical and machine learning approaches used to address these challenges and their results.

Original languageEnglish
Title of host publicationProceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019
EditorsWeiming Shen, Hugo Paredes, Junzhou Luo, Jean-Paul Barthes
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages146-151
Number of pages6
ISBN (Electronic)9781728103501, 9781728103495
DOIs
Publication statusPublished - 8 Aug 2019
Event23rd IEEE International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019 - Porto, Portugal
Duration: 6 May 20198 May 2019

Publication series

NameProceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019

Conference

Conference23rd IEEE International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019
CountryPortugal
CityPorto
Period6/05/198/05/19

Fingerprint

Learning systems
Small sample
Incomplete information
Big data
Development projects
Politicians
Local government
Machine learning
Public organizations
Statistical learning
Usability

Keywords

  • Big data
  • Machine learning
  • Multivariate regression
  • Random forest

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Human-Computer Interaction
  • Information Systems
  • Information Systems and Management

Cite this

Linford, S., Bogdanovic, B., Chao, K. M., Jankovic, S., Maras, V., Bugarinovic, M., & Trochidis, I. (2019). Data analysis on big data applications with small samples and incomplete information. In W. Shen, H. Paredes, J. Luo, & J-P. Barthes (Eds.), Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019 (pp. 146-151). [8791927] (Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CSCWD.2019.8791927

Data analysis on big data applications with small samples and incomplete information. / Linford, Soizic; Bogdanovic, Benjamin; Chao, Kuo Ming; Jankovic, Sladana; Maras, Vladislav; Bugarinovic, Mirjana; Trochidis, Ilias.

Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019. ed. / Weiming Shen; Hugo Paredes; Junzhou Luo; Jean-Paul Barthes. Institute of Electrical and Electronics Engineers Inc., 2019. p. 146-151 8791927 (Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019).

Research output: Chapter in Book/Report/Conference proceedingConference proceeding

Linford, S, Bogdanovic, B, Chao, KM, Jankovic, S, Maras, V, Bugarinovic, M & Trochidis, I 2019, Data analysis on big data applications with small samples and incomplete information. in W Shen, H Paredes, J Luo & J-P Barthes (eds), Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019., 8791927, Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019, Institute of Electrical and Electronics Engineers Inc., pp. 146-151, 23rd IEEE International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019, Porto, Portugal, 6/05/19. https://doi.org/10.1109/CSCWD.2019.8791927
Linford S, Bogdanovic B, Chao KM, Jankovic S, Maras V, Bugarinovic M et al. Data analysis on big data applications with small samples and incomplete information. In Shen W, Paredes H, Luo J, Barthes J-P, editors, Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 146-151. 8791927. (Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019). https://doi.org/10.1109/CSCWD.2019.8791927
Linford, Soizic ; Bogdanovic, Benjamin ; Chao, Kuo Ming ; Jankovic, Sladana ; Maras, Vladislav ; Bugarinovic, Mirjana ; Trochidis, Ilias. / Data analysis on big data applications with small samples and incomplete information. Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019. editor / Weiming Shen ; Hugo Paredes ; Junzhou Luo ; Jean-Paul Barthes. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 146-151 (Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019).
@inproceedings{ed5e3a7855eb4820b58789d8a6356399,
title = "Data analysis on big data applications with small samples and incomplete information",
abstract = "The EU and other public organizations at different levels of national and local government across the world have funded and invested in numerous research and development projects on big data transport applications over last few years. The mid and long term effectiveness of these applications is very difficult to measure, and the benefits and usability of these applications are not easy to calculate. NOESIS, funded under EU H2020 program, aims to design a decision supported tool by gathering and analyzing these applications as use cases to formulate sufficient knowledge for policy makers to make informed decisions for their big data transport applications. The challenges in this work are associated with a small number of samples, with incomplete information, but having a good size of features that need to be analyzed to make a confident enough recommendation. This paper reports various statistical and machine learning approaches used to address these challenges and their results.",
keywords = "Big data, Machine learning, Multivariate regression, Random forest",
author = "Soizic Linford and Benjamin Bogdanovic and Chao, {Kuo Ming} and Sladana Jankovic and Vladislav Maras and Mirjana Bugarinovic and Ilias Trochidis",
year = "2019",
month = "8",
day = "8",
doi = "10.1109/CSCWD.2019.8791927",
language = "English",
series = "Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "146--151",
editor = "Weiming Shen and Hugo Paredes and Junzhou Luo and Jean-Paul Barthes",
booktitle = "Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019",
address = "United States",

}

TY - GEN

T1 - Data analysis on big data applications with small samples and incomplete information

AU - Linford, Soizic

AU - Bogdanovic, Benjamin

AU - Chao, Kuo Ming

AU - Jankovic, Sladana

AU - Maras, Vladislav

AU - Bugarinovic, Mirjana

AU - Trochidis, Ilias

PY - 2019/8/8

Y1 - 2019/8/8

N2 - The EU and other public organizations at different levels of national and local government across the world have funded and invested in numerous research and development projects on big data transport applications over last few years. The mid and long term effectiveness of these applications is very difficult to measure, and the benefits and usability of these applications are not easy to calculate. NOESIS, funded under EU H2020 program, aims to design a decision supported tool by gathering and analyzing these applications as use cases to formulate sufficient knowledge for policy makers to make informed decisions for their big data transport applications. The challenges in this work are associated with a small number of samples, with incomplete information, but having a good size of features that need to be analyzed to make a confident enough recommendation. This paper reports various statistical and machine learning approaches used to address these challenges and their results.

AB - The EU and other public organizations at different levels of national and local government across the world have funded and invested in numerous research and development projects on big data transport applications over last few years. The mid and long term effectiveness of these applications is very difficult to measure, and the benefits and usability of these applications are not easy to calculate. NOESIS, funded under EU H2020 program, aims to design a decision supported tool by gathering and analyzing these applications as use cases to formulate sufficient knowledge for policy makers to make informed decisions for their big data transport applications. The challenges in this work are associated with a small number of samples, with incomplete information, but having a good size of features that need to be analyzed to make a confident enough recommendation. This paper reports various statistical and machine learning approaches used to address these challenges and their results.

KW - Big data

KW - Machine learning

KW - Multivariate regression

KW - Random forest

UR - http://www.scopus.com/inward/record.url?scp=85071429149&partnerID=8YFLogxK

U2 - 10.1109/CSCWD.2019.8791927

DO - 10.1109/CSCWD.2019.8791927

M3 - Conference proceeding

T3 - Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019

SP - 146

EP - 151

BT - Proceedings of the 2019 IEEE 23rd International Conference on Computer Supported Cooperative Work in Design, CSCWD 2019

A2 - Shen, Weiming

A2 - Paredes, Hugo

A2 - Luo, Junzhou

A2 - Barthes, Jean-Paul

PB - Institute of Electrical and Electronics Engineers Inc.

ER -