ISSN:
 2380-6966

eISSN:
 2380-6974

All Issues

Volume 2, 2017

Volume 1, 2016

Big Data and Information Analytics (BigDIA) is an interdisciplinary quarterly journal promoting cutting-edge research, technology transfer and knowledge translation about complex data and information processing.

The journal papers will be organized quarterly and published online first. At the end of each year, there will be a hardcopy volume, consisting of the four issues. Institutes subscribing to the journal have access to the electronic access and can purchase a hard copy. Hard copies can also be sold individually.

BigDIA publishes Research articles (long and original research); Communications (short and novel research); Expository papers; Technology Transfer and Knowledge Translation reports (description of new technologies and products); Announcements and Industrial Progress and News (announcements and even advertisement, including major conferences).

  • AIMS is a member of COPE. All AIMS journals adhere to the publication ethics and malpractice policies outlined by COPE.
  • Publishes 4 issues a year in January, April, July and October.
  • Publishes online only.
  • Archived in Portico and CLOCKSS
  • BDIA is a publication of the American Institute of Mathematical Sciences. All rights reserved.

Note: “Most Cited” is by Cross-Ref , and “Most Downloaded” is based on available data in the new website.

Select all articles

Export/Reference:

First steps in the investigation of automated text annotation with pictures
J. Kent Poots and Nick Cercone
2017, 2(2) : 97-106 doi: 10.3934/bdia.2017001 +[Abstract](690) +[HTML](326) +[PDF](370.11KB)
Abstract:

We describe the investigation of automatic annotation of text with pictures, where knowledge extraction uses dependency parsing. Annotation of text with pictures, a form of knowledge visualization, can assist understanding. The problem statement is, given a corpus of images and a short passage of text, extract knowledge (or concepts), and then display that knowledge in pictures along with the text to help with understanding. A proposed solution framework includes a component to extract document concepts, a component to match document concepts with picture metadata, and a component to produce an amalgamated output of text and pictures. A proof-of-concept application based on the proposed framework provides encouraging results

Rendering website traffic data into interactive taste graph visualizations
Ana Jofre, Lan-Xi Dong, Ha Phuong Vu, Steve Szigeti and Sara Diamond
2017, 2(2) : 107-118 doi: 10.3934/bdia.2017003 +[Abstract](1035) +[HTML](186) +[PDF](2054.58KB)
Abstract:

We present a method by which to convert a large corpus of website traffic data into interactive and practical taste graph visualizations. The website traffic data lists individual visitors' level of interest in specific pages across the website; it is a tripartite list consisting of anonymized visitor ID, webpage ID, and a score that quantifies interest level. Taste graph visualizations reveal psychological profiles by revealing connections between consumer tastes; for example, an individual with a taste for A may be also have a taste for B. We describe here the method by which we map the web traffic data into a form that can be displayed as interactive taste graphs, and we describe design strategies for communicating the revealed information. In the context of the publishing industry, this interactive visualization is a tool that renders the large corpus of website traffic data into a form that is actionable for marketers and advertising professionals. It could equally be used as a method to personalize services in the domains of government services, education or health and wellness.

Proportional association based roi model
Wenxue Huang, Yuanyi Pan and Lihong Zheng
2017, 2(2) : 119-125 doi: 10.3934/bdia.2017004 +[Abstract](654) +[HTML](138) +[PDF](280.31KB)
Abstract:

Based on a local-to-global proportional association measure proposed by Huang, Shi and Wang [9], with cost and revenue information known, an association measure is proposed to maximize the expected RoI. A descriptive experiment with a synthetical data set is presented.

Big data collection and analysis for manufacturing organisations
Pankaj Sharma, David Baglee, Jaime Campos and Erkki Jantunen
2017, 2(2) : 127-139 doi: 10.3934/bdia.2017002 +[Abstract](910) +[HTML](195) +[PDF](403.1KB)
Abstract:

Data mining applications are becoming increasingly important for the wide range of manufacturing and maintenance processes. During daily operations, large amounts of data are generated. This large volume and variety of data, arriving at a greater velocity has its own advantages and disadvantages. On the negative side, the abundance of data often impedes the ability to extract useful knowledge. In addition, the large amounts of data stored in often unconnected databases make it impractical to manually analyse for valuable decision-making information. However, an advent of new generation big data analytical tools has started to provide large scale benefits for the organizations. The paper examines the possible data inputs from machines, people and organizations that can be analysed for maintenance. Further, the role of big data within maintenance is explained and how, if not managed correctly, big data can create problems rather than provide solutions. The paper highlights the need to have advanced mining techniques to enable conversion of data into information in an acceptable time frame and to have modern analytical tools to extract value from the big datasets.

Identifying electronic gaming machine gambling personae through unsupervised session classification
Maria Gabriella Mosquera and Vlado Keselj
2017, 2(2) : 141-175 doi: 10.3934/bdia.2017015 +[Abstract](1062) +[HTML](248) +[PDF](17775.63KB)
Abstract:

The rising accessibility in gambling products, such as Electronic Gaming Machines (EGM), has increased interest in the effects of gambling; in particular, the potential for impulse control disorders, such as problem gambling. Nevertheless, empirical research of EGM gambling behaviour is scarce. In this exploratory study, we apply data mining techniques on 46,416 gambling sessions, collected in situ from 288 EGMs. Our research focused on identifying the at-risk behavioural markers of sessions to help distinguish gambling personae. Our data included measures of gambling involvement, out-of pocket expense of sessions, amount won, and cost of gambling. This research, discusses the methodology used to collect and analyze the required gambling measures, explains the criteria used for identifying valid sessions, and combines outlier mining methods to identify instances of heavily involved gambling (i.e., outliers). Our results suggest that sessions were classified as potential non-problem, potential low-risk, potential moderate risk, and potential problem gambling sessions. Further, outlier sessions were more heavily involved in terms of gambling intensity and amount redeemed, despite having low duration times. Finally, our methods suggest that the lack of player identification does not prevent one from identifying the potential incidence of problem gambling behaviour.

An ontological account of flow-control components in BPMN process models
Xing Tan, Yilan Gu and Jimmy Xiangji Huang
2017, 2(2) : 177-189 doi: 10.3934/bdia.2017016 +[Abstract](735) +[HTML](150) +[PDF](456.36KB)
Abstract:

The Business Process Model and Notation (BPMN) has been widely adopted in the recent years as one of the standard languages for visual description of business processes. BPMN however does not include a formal semantics, which is required for formal verification and validation of behaviors of BPMN models.

Towards bridging this gap using first-order logic, we in this paper present an ontological/formal account of flow-control components in BPMN, using Situation Calculus and Petri nets. More precisely, we use SCOPE (Situation Calculus Ontology of PEtri nets), developed from our previous work, to formally describe flow-control related basic components (i.e., events, tasks, and gateways) in BPMN as SCOPE-based procedures. These components are first mapped from BPMN onto Petri nets.

Our approach differs from other major approaches for assigning semantics to BPMN (e.g., the ones applying communicating sequential processes, or abstract state machines) in the following aspects. Firstly, the approach supports direct application of automated theorem proving for checking theory consistency or verifying dynamical properties of systems. Secondly, it defines concepts through aggregation of more basic concepts in a hierarchical way thus the adoptability and extensibility of the models are presumably high. Thirdly, Petri-net-based implementation is completely encapsulated such that interfaces between the system and its users are defined completely within a BPMN context. Finally, the approach can easily further adopt the concept of time.

On balancing between optimal and proportional categorical predictions
Wenxue Huang and Yuanyi Pan
2016, 1(1) : 129-137 doi: 10.3934/bdia.2016.1.129 +[Abstract](855) +[PDF](289.1KB) Cited By(3)
Towards big data processing in clouds: An online cost-minimization approach
Weidong Bao, Wenhua Xiao, Haoran Ji, Chao Chen, Xiaomin Zhu and Jianhong Wu
2016, 1(1) : 15-29 doi: 10.3934/bdia.2016.1.15 +[Abstract](887) +[PDF](547.6KB) Cited By(1)
Older adults, frailty, and the social and behavioral determinants of health
Grace Gao, Sasank Maganti and Karen A. Monsen
2017, 2(3&4) : 1-12 doi: 10.3934/bdia.2017012 +[Abstract](866) +[HTML](663) +[PDF](1029.95KB) Cited By(1)
A review on low-rank models in data analysis
Zhouchen Lin
2016, 1(2&3) : 139-161 doi: 10.3934/bdia.2016001 +[Abstract](1138) +[PDF](946.5KB) Cited By(1)
How do I choose the right NoSQL solution? A comprehensive theoretical and experimental survey
Hamzeh Khazaei, Marios Fokaefs, Saeed Zareian, Nasim Beigi-Mohammadi, Brian Ramprasad, Mark Shtern, Purwa Gaikwad and Marin Litoiu
2016, 1(2&3) : 185-216 doi: 10.3934/bdia.2016004 +[Abstract](677) +[PDF](1687.0KB) Cited By(1)
Advanced Disaster, Emergency and Rapid Response Simulation (ADERSIM)
Jimmy Huang, Ali Asgary and Jianhong Wu
2016, 1(1) : v-v doi: 10.3934/bdia.2016.1.1v +[Abstract](822) +[PDF](92.0KB) Cited By(1)
A soft subspace clustering algorithm with log-transformed distances
Guojun Gan and Kun Chen
2016, 1(1) : 93-109 doi: 10.3934/bdia.2016.1.93 +[Abstract](967) +[PDF](387.2KB) Cited By(1)
What's the big deal about big data?
Nick Cercone and F'IEEE
2016, 1(1) : 31-79 doi: 10.3934/bdia.2016.1.31 +[Abstract](887) +[PDF](870.8KB) Cited By(1)
An evolutionary multiobjective method for low-rank and sparse matrix decomposition
Tao Wu, Yu Lei, Jiao Shi and Maoguo Gong
2017, 2(1) : 23-37 doi: 10.3934/bdia.2017006 +[Abstract](1180) +[HTML](62) +[PDF](772.1KB) Cited By(0)
An ontological account of flow-control components in BPMN process models
Xing Tan, Yilan Gu and Jimmy Xiangji Huang
2017, 2(2) : 177-189 doi: 10.3934/bdia.2017016 +[Abstract](735) +[HTML](150) +[PDF](456.36KB) Cited By(0)
Prediction models for burden of caregivers applying data mining techniques
Sunmoo Yoon, Maria Patrao, Debbie Schauer and Jose Gutierrez
2017, 2(5) : 1-9 doi: 10.3934/bdia.2017014 +[Abstract](946) +[HTML](533) +[PDF](304.31KB) PDF Downloads(43)
Older adults, frailty, and the social and behavioral determinants of health
Grace Gao, Sasank Maganti and Karen A. Monsen
2017, 2(3&4) : 1-12 doi: 10.3934/bdia.2017012 +[Abstract](866) +[HTML](663) +[PDF](1029.95KB) PDF Downloads(42)
A category-based probabilistic approach to feature selection
Jianguo Dai, Wenxue Huang and Yuanyi Pan
2017, 2(5) : 1-8 doi: 10.3934/bdia.2017020 +[Abstract](115) +[HTML](85) +[PDF](272.52KB) PDF Downloads(30)
A novel approach using incremental under sampling for data stream mining
Anupama N and Sudarson Jena
2017, 2(5) : 1-13 doi: 10.3934/bdia.2017017 +[Abstract](524) +[HTML](400) +[PDF](466.28KB) PDF Downloads(30)
An evolutionary multiobjective method for low-rank and sparse matrix decomposition
Tao Wu, Yu Lei, Jiao Shi and Maoguo Gong
2017, 2(1) : 23-37 doi: 10.3934/bdia.2017006 +[Abstract](1180) +[HTML](62) +[PDF](772.1KB) PDF Downloads(26)
First steps in the investigation of automated text annotation with pictures
J. Kent Poots and Nick Cercone
2017, 2(2) : 97-106 doi: 10.3934/bdia.2017001 +[Abstract](690) +[HTML](326) +[PDF](370.11KB) PDF Downloads(22)
Fuzzy temporal meta-clustering of financial trading volatility patterns
Pawan Lingras, Farhana Haider and Matt Triff
2017, 2(5) : 1-20 doi: 10.3934/bdia.2017018 +[Abstract](507) +[HTML](353) +[PDF](725.92KB) PDF Downloads(20)
What can we learn about the Middle East Respiratory Syndrome (MERS) outbreak from tweets?
Sunmoo Yoon, Da Kuang, Peter Broadwell, Haeyoung Lee and Michelle Odlum
2017, 2(5) : 1-5 doi: 10.3934/bdia.2017013 +[Abstract](684) +[HTML](440) +[PDF](3155.87KB) PDF Downloads(19)
Modeling daily guest count prediction
Fok Ricky, Lasek Agnieszka, Li Jiye and An Aijun
2016, 1(4) : 299-308 doi: 10.3934/bdia.2016012 +[Abstract](858) +[HTML](366) +[PDF](644.45KB) PDF Downloads(17)
Big data collection and analysis for manufacturing organisations
Pankaj Sharma, David Baglee, Jaime Campos and Erkki Jantunen
2017, 2(2) : 127-139 doi: 10.3934/bdia.2017002 +[Abstract](910) +[HTML](195) +[PDF](403.1KB) PDF Downloads(16)

Editors

Referees

Librarians

Email Alert

[Back to Top]