2015 Analytics and Big Data Summit Abstracts


Trends and Developments in Big Data

Addison Snell, Chief Executive Officer, Intersect360 Research

Abstract

Pending


Adding Omics Data to Electronic Health Record, a Paradigm Shift in Big Data Modeling, Analytics and Storage Management for Healthcare and Life Sciences Organizations

Ali Eghlima, Director of Bioinformatic, Expert BioSystems

Abstract

Illumina CEO, recently announced availability of whole genome sequencing for just under $1000. By 2020 whole genome sequencing could cost about $200. Today, utilizing these technologies, a typical research program could generate from tens of terabytes to petabytes of data for a single study. Within ten years, a large genomic research program may need to analyze many petabytes to Exabyte of data.

Adding patient’s genomic date to patient Electronic Health Record (EHR) will increase per patient dataset size from at most a few Gigabytes (today) to several terabytes. So, in a mid to large size hospital computer storage requirements, and associated computing power and network infrastructure performance will need to increase by at least three order of magnitude. Due to patient privacy, regulatory requirements, and issues related to cyber security, healthcare institute such as major hospitals are very reluctant in utilizing public cloud computing, and also, private cloud technology is not appropriate for distributed research collaboration, and large-scale interoperability across many organizations.

Current computing infrastructure of most life sciences research centers, and healthcare organizations/hospitals have not been architected/designed to handle “HUGE” Big Data analytics, which is require to manage many Petabytes to Exabyte dataset class size, especially addressing requirements with regard to research collaboration across many organizations.

Learning Objectivies

  • Review current technology, and common systems architecture used for Big Data Analytics in Health Sciences vs other industries
  • Discuss issues, challenges and potential solutions for real-time and archived data storage managements
  • Review, Data integrity/Privacy/Cyber Security concerns of major healthcare/research centers
  • Present scalable open source computing platform to manage Exabyte class datasets

Spark — Fast and Easy Processing of Big Data

Sujee Maniyam, Big Data Consultant / Trainer, ElephantScale

Abstract

Apache Spark has been gaining a lot of momentum in the Big Data world. Spark offers efficient processing of data by taking advantage of in-memory data. Hence Spark is better suited for quick iterative algorithms for machine learning, than Hadoop map-reduce framework. This talk will introduce Spark to audience. We will compare and contrast it with Hadoop and show a quick demo.


Strategies and New Technology for Long Term Preservation of Big Data

Sam Fineberg, Distinguished Technologist, HP

Abstract

Pending


-->

Reforming EU Data Protection

Eric Hibbard, CTO Security & Privacy Hitachi Data Systems; SNIA Security TWG Chair

Abstract

After reviewing the diverging data protection legislation in the EU member states, the European Commission (EC) decided that this situation would impede the free flow of data within the EU zone. The EC response was to undertake an effort to "harmonize" the data protection regulations and it started the process by proposing a new data protection framework. This proposal includes some significant changes like defining a data breach to include data destruction, adding the right to be forgotten, adopting the U.S. practice of breach notifications, and many other new elements. Another major change is a shift from a directive to a rule, which means the protections are the same for all 27 countries and includes significant financial penalties for infractions. This tutorial explores the new EU data protection legislation and highlights the elements that could have significant impacts on data handling practices.

Learning Objectivies

  • Highlight the major changes to the previous data protection directive
  • Understand the differences between “Directives” versus “Regulations”, as it pertains to the EU members
  • Learn the nature of the Reforms as well as the specific proposed changes – in both the directives and the regulations