Big Data Processing using Hadoop (DA 512)

2021 Spring
Faculty of Engineering and Natural Sciences
Data Analytics(DA)
3
6
Ahmet Demirelli ahmetdemirelli@sabanciuniv.edu,
Click here to view.
Doctoral, Master
--
Click here to view. Course not found!

CONTENT

This course will provide the essential background to start to develop programs that will run on Hadoop Distributed File System (HDFS). The course will also show the students the limitations of traditional programming techniques and how Hadoop addresses these problems. After learning the basics of a Hadoop Cluster and Hadoop Ecosystem, students will learn to write programs using MapReduce framework and run these programs on a Hadoop Cluster. There will be introductory level information about Pig, Hive.

PROGRAMME OUTCOMES


1. Develop the ability to use critical, analytical, and reflective thinking and reasoning

2. Reflect on social and ethical responsibilities in his/her professional life.

3. Gain experience and confidence in the dissemination of project/research outputs

4. Work responsibly and creatively as an individual or as a member or leader of a team and in multidisciplinary environments.

5. Communicate effectively by oral, written, graphical and technological means and have competency in English.

6. Independently reach and acquire information, and develop appreciation of the need for continuously learning and updating.


1. Design and model engineering systems and processes and solve engineering problems with an innovative approach.

2. Establish experimental setups, conduct experiments and/or simulations.

3. Analytically acquire and interpret data.


1. Comprehend the conceptual foundations of analytical methods and techniques within the scope of business analytics,

2. Acquire theoretical and practical knowledge on applied information systems by developing fundamental programming skills,

3. Improve decision making by turning high-volume data into useful information and integrating data analysis tools

4. Turn high-volume data into useful information by using quantitative models and understanding and managing data analysis techniques, communicate and visualize the results for business use

5. Understand the data quality, data integrity and data accuracy concepts, and occupational ethics regarding data privacy and intellectual property