About ABIS
All CoursesbalkjeGeneral courses » Introduction to HW & SW » Soft skills » TracksOperating systems » MVS - z/OS » Linux - UNIX » Mac OS X » iPad and iPhone iOSDatabases and middleware » Relational databases & SQL » Db2 for z/OS » Db2 for LUW » Oracle » SQL Server » MySQL & MariaDB » IMS » CICS » IBM MQ » WebSphere » Data Science, Big Data and AnalyticsApplication development » Methods and techniques » TOGAF » PRINCE2 » Agile development and Scrum » Programming languages » Internet development » Object Oriented systems » Java » Development tools » SAS » XML » SOA & web servicesSystems management » ITIL » SecuritybalkjePractical informationRegistration 
Big data in practice using Hadoop

Nowadays everybody seems to be working with "big data". Do you also want to interrogate your several data sources (click streams, social media, relational data, sensor data, ...) and are you experiencing the shortcomings of traditional data tools? Maybe you are in need of distributed data stores like HDFS and a MapReduce infrastructure like Hadoop's.

This course builds on the concepts which are set forth in the Big data architecture and infrastructure course. you will get hands-on practice on linux with Apache Hadoop: HDFS, Yarn, Pig, and Hive. You learn how to implement robust data processing with an SQL-style interface which generates MapReduce jobs. You also learn to work with the graphical tools which allow for easy follow-up of the jobs and the workflows on the distributed Hadoop cluster.

After successful completion of the course, you will have sufficient basic expertise to set up a Hadoop cluster, to import data into HDFS, and to interrogate it clevery using MapReduce.

When you want to use Hadoop with Spark, you are referred to the course Big data in practice using Spark.


No public sessions are currently scheduled. We will be pleased to set up an on-site course or to schedule an extra public session (in case of a sufficient number of candidates). Interested? Please contact ABIS.

Intended for

Whoever wants to start practising "big data": developers, data architects, and anyone who needs to work with big data technology.


Familiarity with the concepts of data stores and more specifically of "big data" is necessary; see our course Big data architecture and infrastructure. Additionally, minimal knowledge of SQL, UNIX and Java are useful. Experience with a programming language (Java, PHP, Python, Perl, C++ or C#) is a must.

Main topics

Training method

Classroom instruction, with practical examples and supported by extensive practical exercises.


2 days.

Course leader

Peter Vanroose.

Global score

4.1/5 (based on 29 evaluations)


Redelijk veel info voor de beschikbare periode (, )
Wel ok, ik denk dat de algemene uitleg veel sneller kan. Soms veel focus op details die voor mij bijna irrelevant lijken. Kan ook aan mij liggen. (, )
Bon debut pour commencer dans le Big data (, )
Goede introductie (, )
Ik vond dit een zeer goede cursus. (, )
Interessante kennismaking. Voor mij soms te veel theorie (, )
De meeste belangrijke punten zijn behandeld in de cursus. (, )
Goed om een overzicht te krijgen (, )
De cursus bevat de nodige informatie en past goed in de 2 dagen. (, )
Happy with the training even if I would spend less time on HDFS and MapReduce and more time in others components (Pig, Hive,...) (, )

Refresh this page to see other comments.

Also interesting

Enrollees for this training also took the following courses: