Prof.
Achim Streit (KIT-SCC)
(KIT-SCC)
9/1/14, 2:00 PM
Prof.
Richard Frackowiak (University of Lausanne)
9/1/14, 4:00 PM
Big Data and Storage Systems
We now know that a single gene mutation may present with multiple phenotypes, and vice versa, that a range of genetic abnormalities may cause a single phenotype. These observations lead to the conclusion that a deeper understanding is needed of the way changes at one spatial or temporal level of organisation (e.g., genetic, proteomic or metabolic) integrate and translate into others,...
Vincent Brillault (CERN)
9/1/14, 5:00 PM
In a computing environment in constant evolution, the security management of our systems need to adapt: cyber-criminals use new attack angles, new technologies and architectures are introduced, old security models are weakened, etc.
This presentation will cover such recent evolutions from a security point of view and discuss new or future security challenges.
Dr
Anthony Brew (IBM)
9/2/14, 9:40 AM
Cloud Computing
In today’s smarter planet whether it’s smart meters in an electric grid,
escalators and security cameras in office buildings, signals and switches
from railroad networks or Wi-Fi in airplanes or the software systems that
support them, our world is filled with devices that are instrumented and
interconnected.
There was a time when a person walking through a building and checking
meters...
Kathrin Spreyer (inovex GmbH)
9/2/14, 10:50 AM
We will present two real-world data warehousing projects we solved using Hadoop. Both projects resulted in hybrid data warehouses, with Hadoop in the backend and a relational database as the interface for both BI tools and business users. We describe the architecture as well as the data sources and data volume involved.
Dr
Massimo Torquati (University of Pisa)
9/2/14, 11:40 AM
FastFlow is an open-source C++ research framework to support the development of multi-threaded applications in modern multi/many-core heterogeneous platforms.
The framework provides well-known stream-based algorithm skeleton constructs such as pipeline, task-farm and loop that are used to build more complex and powerful pattern: parallel_for, map, reduce, macro data-flow interpreter,...
Dr
Benedikt Hegner (CERN)
9/3/14, 9:40 AM
Michael Gienger (University of Stuttgart)
9/3/14, 10:50 AM
With the advent of cloud computing, flexible and scalable services have been provided with the ambition to utilize bare metal resources in a more efficient way. The base technology for cloud computing is represented by virtualization; hence servers can contain several virtualized operating systems in a single physical box. As a small example, most of the servers offering web services are...
Christian Dornacher (HITACHI DATA SYSTEMS GmbH)
(HITACHI DATA SYSTEMS GmbH)
9/4/14, 9:00 AM
Big Data Analytics: Strategy and Use-Cases
The presentation by Christian Dornacher covers Hitachi’s strategy for Big Data Analytics solutions based on existing know-how from solutions like predictive maintenance and log-analytics. It also shows different customer use-cases and how these customers plan to get better insight in their data.
About the presenter
Christian Dornacher has...
Dr
Jürgen Hagedorn (SAP)
9/4/14, 9:40 AM
Karl Fürlinger (University of Munich)
9/5/14, 9:00 AM
The two most common approaches for parallel programming are message
passing (for example using MPI, the message passing interface) and
threading (for example using OpenMP or Pthreads). Threading is generally
considered an easier and more straightforward solution for parallel
programming but it can generally only be used on a single shared memory
node. MPI, on the other hand, scales...
Dr
Fabrizio Gagliardi (University of Catalonia)
9/5/14, 10:50 AM
The talk will discuss the current transformation in the computing landscape. The advent of Virtualization have made possible highly scalable and affordable distributed computing systems such as those offered by Cloud providers, public or private. This poses new challenges and problems to do with latency in accessing the data, SLAs, privacy and security issues. At the same time the explosion of...
Dr
Graeme Stewart (CERN)
Modern Programming
In this course we will introduce how to program for concurrency in C++, taking advantage of modern CPUs ability to run multi-threaded programs on different CPU cores. Firstly, we will explore the new concurrency features of C++11 itself, which will also serve as a general introduction to multi-threaded programming. Students will learn the basics of asynchronous execution, thread spawning,...
Ben Dylan Jones (CERN),
Sven Sternberger (DESY), Dr
Yves Kemp (DESY)
Cloud Computing
Puppet is a configuration management tool adopted by many institutions in academia and industry of different size.Puppet can be used to configure many different operating systems and applications. Puppet integrates well with other tools e.g. Foreman, MCollective, ...
The workshop will feature a hands-on tutorial on Puppet allowing users to write simple manifests themselves and managing them...
Mr
Felice Pantaleo (University of Pisa)
Modern Programming
While the computing community is racing to build tools and libraries to
ease the use of heterogeneous parallel computing systems, effective and
confident use of these systems will always require knowledge about the
low-level programming interfaces in these systems.
This workshop is designed to introduce the CUDA programming language,
through examples and hands-on exercises so as to...
Cesare Delle Fratte (RZG)
(Rechenzentrum Garching (RZG)),
Christian Bernardt (DESY),
Christoph Anton Mitterer (University of Munich),
Luca Mazzaferro (RZG)
(Rechenzentrum Garching (RZG)),
Oleg Tsigenov (Aachen)
Big Data and Storage Systems
dCache is one of the most used storage solutions in the WLCG consisting of over 94 PB of storage distributed world wide on >77 sites. Depending on the Persistency Model, dCache provides methods for exchanging data with backend (tertiary) Storage Systems as well as space management, pool attraction, dataset replication, hot spot determination and recovery from disk or node failures. Beside HEP...
Kathrin Spreyer (inovex GmbH)
Big Data and Storage Systems
In the last couple of years Hadoop established itself as the de facto standard for dealing with large and very large datasets. However, Hadoop does introduce quite a lot of challenges for developers with a background of classical data analytics. One example is handling raw data (e.g., logfiles) which works quite differently in Hadoop than in classical, data warehouse focused architectures....
Mirko Kämpf (Cloudera)
Big Data and Storage Systems
Usage of Apache Hadoop in large scale Data Analysis Projects are on the way to become mainstream. But what are the required skills and how do I start with an Apache Hadoop project? The workshop shows and compares several aspects which should be considered in the beginning of large projects. How do I start with a POC and how works this: "scale out"? What data is stored how and how do I access...
Dr
Kenji Takeda (Microsoft)
Cloud Computing
Microsoft Azure is a general, open, and flexible global cloud platform supporting any language, tool, or framework - including Linux, Java, Python, and other non-Microsoft technologies. It is ideally suited to researchers’ needs across disciplines. The workshop is intended specifically for active scientists who can code, who will soon code, or are interested in coding in a modern computing...
Dr
Marek Szuba (KIT),
Parinaz Ameri
(KIT)
Big Data and Storage Systems
This session is an introduction to a particular NoSQL database, MongoDB. MongoDB is an open-source database with document-oriented storage approach. Since it doesn’t enforce any schema on data and because of its good performance, Mongo is nowadays widely used especially where unstructured data storage is needed. In addition, Mongo scales well and even provides partitioning over cluster of...
Antonio Messina,
Tyanko Aleksiev (S3IT, University of Zurich)
(S3IT, University of Zurich)
Cloud Computing
OpenStack is currently one of the most evolving open IaaS solutions available. Every new release comes with a huge set of new features. It can be hard to hold pace with such changes. Starting from scratch also proves difficult due to the complexity of the several components interacting with each other but also due to the lack of exhaustive documentation. The proposed training targets system...
Dr
Torquati Massimo (University of Pisa)
Modern Programming
During this tutorial session, the participants will learn how to build application structured as a combination of stream-based parallel pattern like pipeline, task-farm loops and their combinations. Then more high-level patterns will be introduced such as parallel_for, map and reduce, and we will see how to mix stream and data-parallel patterns to build simple (and not so simple) applications....
Mario Lassnig (CERN)
Big Data and Storage Systems
Throughout the course, the students will implement a full database application with safe and efficient methods, based on the concepts learned. Additionally, where necessary, pointers to the NoSQL/non-relational database sessions with MongoDB and Hadoop are given.Basic understanding of Linux and programming (at least C or Python) is required for this session.
The agenda is as follows:
Part 1:...
Prof.
Frank Kirchner (University of Bremen)
Robotics & Artificial Intelligence
In recent years robotics has gained a lot of interest also in the area of artificial intelligence. While systems for a long time have been used as tools to implement classical AI approaches in the area of object recognition, environment representation, path and motion planning etc., researchers now begin to understand that the system (robot) itself is...