Data Science - Volume 1, issue 1-2 - Journals

Show:

results per page

Data Science – Methods, infrastructure, and applications

Authors: Dumontier, Michel | Kuhn, Tobias

Article Type: Editorial

DOI: 10.3233/DS-170013

Citation: Data Science, vol. 1, no. 1-2, pp. 1-5, 2017

Get PDF

Conflict forecasting and its limits

Authors: Chadefaux, Thomas

Article Type: Position Paper

Abstract: Research on international conflict has mostly focused on explaining events such as the onset or termination of wars, rather than on trying to predict them. Recently, however, forecasts of political phenomena have received growing attention. Predictions of violent events, in particular, have been increasingly accurate using various methods ranging from expert knowledge to quantitative methods and formal modeling. Yet, we know little about the limits of these approaches, even though information about these limits has critical implications for both future research and policy-making. In particular, are our predictive inaccuracies due to limitations of our models, data, or assumptions, in which …case improvements should occur incrementally. Or are there aspects of conflicts that will always remain fundamentally unpredictable? After reviewing some of the current approaches to forecasting conflict, I suggest avenues of research that could disentangle the causes of our current predictive failures. Show more

Keywords: Conflict, war, forecasting, tournaments, predictability

DOI: 10.3233/DS-170002

Citation: Data Science, vol. 1, no. 1-2, pp. 7-17, 2017

Get PDF

Knowledge-based biomedical Data Science

Authors: Hunter, Lawrence E.

Article Type: Position Paper

Abstract: Computational manipulation of knowledge is an important, and often under-appreciated, aspect of biomedical Data Science. The first Data Science initiative from the US National Institutes of Health was entitled “Big Data to Knowledge (BD2K).” The main emphasis of the more than $200M allocated to that program has been on “Big Data;” the “Knowledge” component has largely been the implicit assumption that the work will lead to new biomedical knowledge. However, there is long-standing and highly productive work in computational knowledge representation and reasoning, and computational processing of knowledge has a role in the world of Data Science. …Knowledge-based biomedical Data Science involves the design and implementation of computer systems that act as if they knew about biomedicine. There are many ways in which a computational approach might act as if it knew something: for example, it might be able to answer a natural language question about a biomedical topic, or pass an exam; it might be able to use existing biomedical knowledge to rank or evaluate hypotheses; it might explain or interpret data in light of prior knowledge, either in a Bayesian or other sort of framework. These are all examples of automated reasoning that act on computational representations of knowledge. After a brief survey of existing approaches to knowledge-based data science, this position paper argues that such research is ripe for expansion, and expanded application. Show more

Keywords: Ontology, knowledge representation, reasoning, inference, machine learning, text mining, explanation

DOI: 10.3233/DS-170001

Citation: Data Science, vol. 1, no. 1-2, pp. 19-25, 2017

Get PDF

Data Science and symbolic AI: Synergies, challenges and opportunities

Authors: Hoehndorf, Robert | Queralt-Rosinach, Núria

Article Type: Position Paper

Abstract: Symbolic approaches to Artificial Intelligence (AI) represent things within a domain of knowledge through physical symbols, combine symbols into symbol expressions, and manipulate symbols and symbol expressions through inference processes. While a large part of Data Science relies on statistics and applies statistical approaches to AI, there is an increasing potential for successfully applying symbolic approaches as well. Symbolic representations and symbolic inference are close to human cognitive representations and therefore comprehensible and interpretable; they are widely used to represent data and metadata, and their specific semantic content must be taken into account for analysis of such information; and human …communication largely relies on symbols, making symbolic representations a crucial part in the analysis of natural language. Here we discuss the role symbolic representations and inference can play in Data Science, highlight the research challenges from the perspective of the data scientist, and argue that symbolic methods should become a crucial component of the data scientists’ toolbox. Show more

Keywords: Symbolic AI, machine learning, statistics, empirical science

DOI: 10.3233/DS-170004

Citation: Data Science, vol. 1, no. 1-2, pp. 27-38, 2017

Get PDF

The knowledge graph as the default data model for learning on heterogeneous knowledge

Authors: Wilcke, Xander | Bloem, Peter | de Boer, Victor

Article Type: Position Paper

Abstract: In modern machine learning, raw data is the preferred input for our models. Where a decade ago data scientists were still engineering features, manually picking out the details we thought salient, they now prefer the data in their raw form. As long as we can assume that all relevant and irrelevant information is present in the input data, we can design deep models that build up intermediate representations to sift out relevant features. However, these models are often domain specific and tailored to the task at hand, and therefore unsuited for learning on heterogeneous knowledge : information of different …types and from different domains. If we can develop methods that operate on this form of knowledge, we can dispense with a great deal more ad-hoc feature engineering and train deep models end-to-end in many more domains. To accomplish this, we first need a data model capable of expressing heterogeneous knowledge naturally in various domains, in as usable a form as possible, and satisfying as many use cases as possible. In this position paper, we argue that the knowledge graph is a suitable candidate for this data model. We further describe current research and discuss some of the promises and challenges of this approach. Show more

Keywords: Knowledge graphs, semantic web, machine learning, end-to-end learning, position paper

DOI: 10.3233/DS-170007

Citation: Data Science, vol. 1, no. 1-2, pp. 39-57, 2017

Get PDF

Stream reasoning: A survey and outlook

Authors: Dell’Aglio, Daniele | Della Valle, Emanuele | van Harmelen, Frank | Bernstein, Abraham

Article Type: Position Paper

Abstract: Stream reasoning studies the application of inference techniques to data characterised by being highly dynamic. It can find application in several settings, from Smart Cities to Industry 4.0, from Internet of Things to Social Media analytics. This year stream reasoning turns ten, and in this article we analyse its growth. In the first part, we trace the main results obtained so far, by presenting the most prominent studies. We start by an overview of the most relevant studies developed in the context of semantic web, and then we extend the analysis to include contributions from adjacent areas, such as database …and artificial intelligence. Looking at the past is useful to prepare for the future: in the second part, we present a set of open challenges and issues that stream reasoning will face in the next future. Show more

Keywords: Stream reasoning, stream processing

DOI: 10.3233/DS-170006

Citation: Data Science, vol. 1, no. 1-2, pp. 59-83, 2017

Get PDF

Maintaining intellectual diversity in data science

Authors: Mann, Richard P. | Woolley-Meza, Olivia

Article Type: Position Paper

Abstract: Data science is a young and rapidly expanding field, but one which has already experienced several waves of temporarily-ubiquitous methodological fashions. In this paper we argue that a diversity of ideas and methodologies is crucial for the long term success of the data science community. Towards the goal of a healthy, diverse ecosystem of different statistical models and approaches, we review how ideas spread in the scientific community and the role of incentives in influencing which research ideas scientists pursue. We conclude with suggestions for how universities, research funders and other actors in the data science community can help to …maintain a rich, eclectic statistical environment. Show more

Keywords: Collective intelligence, diversity, contagion networks

DOI: 10.3233/DS-170003

Citation: Data Science, vol. 1, no. 1-2, pp. 85-94, 2017

Get PDF

The integration of the data scientist into the team: Implications and challenges

Authors: Desai, Manisha

Article Type: Position Paper

Abstract: Modern biomedical research is complex and requires a cross section of experts collaborating using multi-, inter-, or transdisciplinary approaches to address scientific questions. Known as team science, such approaches have become so critical it has given rise to a new field – the science of team science. In biomedical research, data scientists often play a critical role in team-based collaborations. Integration of data scientists into research teams has multiple advantages to the clinical and translational investigator as well as to the data scientist. Clinical and translational investigators benefit from having an invested dedicated collaborator who can assume principal responsibility for essential …data-related activities, while the data scientist can build a career developing tools that are relevant and data-driven. Participation in team science, however, can pose challenges. One particular challenge is the ability to appropriately evaluate the data scientist’s scholarly contributions, necessary for promotion. Only a minority of academic health centers have attempted to address this challenge. In order for team science to thrive on academic campuses, leaders of institutions need to hire data science faculty for the purpose of doing team science, with novel systems in place that incentivize the data scientist’s engagement in team science and that allow for appropriate evaluation of performance. Until such systems are adopted at the institutional level, the ability to conduct team science to address modern biomedical research with its increasingly complex data needs will be compromised. Fostering team science on campuses by putting supportive systems in place will benefit not only clinical and translational investigators as well as data scientists, but also the larger academic institution. Show more

Keywords: Team science, promotion criteria, collaboration

DOI: 10.3233/DS-170008

Citation: Data Science, vol. 1, no. 1-2, pp. 95-100, 2017

Get PDF

Cross-disciplinary higher education of data science – beyond the computer science student

Authors: Pournaras, Evangelos

Article Type: Position Paper

Abstract: The majority of economic sectors are transformed by the abundance of data. Smart grids, smart cities, smart health, Industry 4.0 impose to domain experts requirements for data science skills in order to respond to their duties and the challenges of the digital society. Business training or replacing domain experts with computer scientists can be costly, limiting for the diversity in business sectors and can lead to sacrifice of invaluable domain knowledge. This paper illustrates experience and lessons learnt from the design and teaching of a novel cross-disciplinary data science course at a postgraduate level in a top-class university. The course …design is approached from the perspectives of the constructivism and transformative learning theory. Students are introduced to a guideline for a group research project they need to deliver, which is used as a pedagogical artifact for students to unfold their data science skills as well as reflect within their team their domain and prior knowledge. In contrast to other related courses, the course content illustrated is designed to be self-contained for students of different discipline. Without assuming certain prior programming skills, students from different discipline are qualified to practice data science with open-source tools at all stages: data manipulation, interactive graphical analysis, plotting, machine learning and big data analytics. Quantitative and qualitative evaluation with interviews outlines invaluable lessons learnt. Show more

Keywords: Education, data science, cross-discipline, big data, research methodology, learning, constructivism theory, transformative theory

DOI: 10.3233/DS-170005

Citation: Data Science, vol. 1, no. 1-2, pp. 101-117, 2017

Get PDF

Thoughtful artificial intelligence: Forging a new partnership for data science and scientific discovery

Authors: Gil, Yolanda

Article Type: Position Paper

Abstract: Artificial intelligence will play an increasingly more prominent role in scientific research ecosystems, and will become indispensable as more interdisciplinary science questions are tackled. While in recent years computers have propelled science by crunching through data and leading to a data science revolution, qualitatively different scientific advances will result from advanced intelligent technologies for crunching through knowledge and ideas. We propose seven principles for developing thoughtful artificial intelligence , which will turn intelligent systems into partners for scientists. We present a personal perspective on a research agenda for thoughtful artificial intelligence, and discuss its potential for data science and scientific …discovery. Show more

Keywords: Data science, scientific discovery, thoughtful artificial intelligence

DOI: 10.3233/DS-170011

Citation: Data Science, vol. 1, no. 1-2, pp. 119-129, 2017

Get PDF

Display: 10 | 50 | 100 items per page

Data Science - Volume 1, issue 1-2

Data Science – Methods, infrastructure, and applications

Conflict forecasting and its limits

Knowledge-based biomedical Data Science

Data Science and symbolic AI: Synergies, challenges and opportunities

The knowledge graph as the default data model for learning on heterogeneous knowledge

Stream reasoning: A survey and outlook

Maintaining intellectual diversity in data science

The integration of the data scientist into the team: Implications and challenges

Cross-disciplinary higher education of data science – beyond the computer science student

Thoughtful artificial intelligence: Forging a new partnership for data science and scientific discovery

North America

Europe

Asia