What’s on

To register your event e-mail info@mediateam.ie

  • Sun
    08
    May
    2016

    Scaling Up Genomics with Spark

    6pm

    Bank of Ireland, 1 Grand Canal Square, Dublin 2


    This meeting of the Hadoop User Group is a Data Science v Big Data Stack themed night with Apache Spark taking a leading role alongside great data analytics in a must do event for all Big Data practitioners and enthusiasts alike.

    Agenda:

    Scaling Up Genomics with Spark by Sean Owen, director of data science, Cloudera
    It’s amazing that our genome so completely and uniquely encodes each of us with a simple 4-protein code, like a file. More amazingly, we’re so similar that we can build a reference map of human genomes and reason about commonalities. Genomics has taken off in the last two decades driven largely by advances in computing; the work of mapping the genome is incredibly data and compute intensive. This talk will briefly introduce the problem of genomics and existing home-grown efforts to bring ‘Big Data’ technology to solve it. It will compare these with the separate rise of technologies like Apache Hadoop and Spark, and how these ideas are helping genomics scale up even further..

    Understanding Your Customers Using Public Data by Michael Crawford, founder, Applied AI
    A data science use case! We were involved in a project to model the quality of a large life insurer’s customer base and wanted to see if socio-economic factors were a useful predictor. Experian provide this type of information but it was prohibitively expensive to use for our purposes (200k+ customers). We took a look at the Irish census data and reckoned we could have a crack at doing what Experian do ourselves. We also thought that it would be a fun and we’d probably learn new stuff along the way.

    For more information visit http://www.meetup.com/hadoop-user-group-ireland/events/230464912/.


Back to Top ↑

TechCentral.ie