GridKa School 2013: Big Data, Clouds and Grids

Name: GridKa School 2013: Big Data, Clouds and Grids
Start: 2013-08-26T12:00:00+02:00
End: 2013-08-30T18:00:00+02:00
Location: KIT Campus North, FTU

Aug 26 – 30, 2013

KIT Campus North, FTU

Europe/Berlin timezone

Hadoop in Complex Systems Research ( A Review on Tools, Best Practices & Applications )

Aug 27, 2013, 9:40 AM

40m

Aula (KIT Campus North, FTU)

Aula

KIT Campus North, FTU

Big Data and Large Storage Systems Plenary talks

Mirko Kämpf (Cloudera)

A Hadoop cluster is the tool of choice for many large scale analytics applications and a large variety of commercial tools is available for Data Warehouses and for typical SQL-like applications. But how to deal with networks and time series? How to collect data for complex systems studies and what are good practices for working with libraries like Mahout and Giraph? The sample use case deals with a data set from Wikipedia to illustrate one can combine multiple public data sources with own personal data collections, e.g. from Twitter, intranet servers or even personal mailboxes. Efficient approaches for time series (pre)-processing and time dependent graph analysis will be presented.

Mirko Kämpf (Cloudera)

Slides

2-Karlsruhe_GridKA_2013-08-27.pdf

GridKa School 2013: Big Data, Clouds and Grids

Hadoop in Complex Systems Research ( A Review on Tools, Best Practices & Applications )

Aula

KIT Campus North, FTU

Speaker

Description

Primary author

Presentation materials

Choose timezone

GridKa School 2013: Big Data, Clouds and Grids

Speaker

Description

Primary author

Presentation materials