The goal of the course is to illustrate the modern solutions to the management of big data, very large repositories of de-structured data. Starting from the requirements of modern database applications, the course will illustrate the hardware and software architectures that have been recently proposed for the management and analysis of big data. The topics addressed in the course will include: cluster architectures, map-reduce paradigm, cloud computing, NoSQL systems, tools and languages for data analysis. Both theoretical and practical aspects will be addressed and the discussed technologies will be experimented during practical classes and through the assignment of projects.
Curriculum
scheda docente
materiale didattico
- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Teacher slides (available on the Web side of the course)
Programma
- Infrastructures and programming paradigms for big data- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Testi Adottati
Martin J. Fowler, PramodkumarJ. Sadalage. "NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence".Teacher slides (available on the Web side of the course)
Bibliografia Di Riferimento
Available on the Web site of the course on Moodle.Modalità Erogazione
The teaching methods and the supporting tools that will be used to achieve the expected learning outcomes are the following: - frontal lessons - practical exercises - seminars - laboratories - teamwork - analysis of real case studiesModalità Valutazione
The final evaluation is based on the development of some projects and on a written test lasting 1 hour. - The projects are carried out in groups and consist of both solving problems assigned by the teacher and carrying out activities agreed with the teacher. - The written test is organized through exercises aimed at verifying the level of effective understanding of the concepts and the students' ability to apply them in real contexts. The tests assigned in previous years are available on the course website.
scheda docente
materiale didattico
- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Teacher slides (available on the Web side of the course)
Programma
- Infrastructures and programming paradigms for big data- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Testi Adottati
Martin J. Fowler, PramodkumarJ. Sadalage. "NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence".Teacher slides (available on the Web side of the course)
Bibliografia Di Riferimento
Available on the Web site of the course on Moodle.Modalità Erogazione
The teaching methods and the supporting tools that will be used to achieve the expected learning outcomes are the following: - frontal lessons - practical exercises - seminars - laboratories - teamwork - analysis of real case studiesModalità Valutazione
The final evaluation is based on the development of some projects and on a written test lasting 1 hour. - The projects are carried out in groups and consist of both solving problems assigned by the teacher and carrying out activities agreed with the teacher. - The written test is organized through exercises aimed at verifying the level of effective understanding of the concepts and the students' ability to apply them in real contexts. The tests assigned in previous years are available on the course website.
scheda docente
materiale didattico
- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Teacher slides (available on the Web side of the course)
Programma
- Infrastructures and programming paradigms for big data- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Testi Adottati
Martin J. Fowler, PramodkumarJ. Sadalage. "NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence".Teacher slides (available on the Web side of the course)
Bibliografia Di Riferimento
Available on the Web site of the course on Moodle.Modalità Erogazione
The teaching methods and the supporting tools that will be used to achieve the expected learning outcomes are the following: - frontal lessons - practical exercises - seminars - laboratories - teamwork - analysis of real case studiesModalità Valutazione
The final evaluation is based on the development of some projects and on a written test lasting 1 hour. - The projects are carried out in groups and consist of both solving problems assigned by the teacher and carrying out activities agreed with the teacher. - The written test is organized through exercises aimed at verifying the level of effective understanding of the concepts and the students' ability to apply them in real contexts. The tests assigned in previous years are available on the course website.
scheda docente
materiale didattico
- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Teacher slides (available on the Web side of the course)
Programma
- Infrastructures and programming paradigms for big data- The Hadoop Ecosystem
- Cloud computing
- Big data processing (MapReduce, Hive, Spark)
- NoSQL systems
- Big data analytics
- Data lakes
- Systems and applications
- Business seminars
Testi Adottati
Martin J. Fowler, PramodkumarJ. Sadalage. "NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence".Teacher slides (available on the Web side of the course)
Bibliografia Di Riferimento
Available on the Web site of the course on Moodle.Modalità Erogazione
The teaching methods and the supporting tools that will be used to achieve the expected learning outcomes are the following: - frontal lessons - practical exercises - seminars - laboratories - teamwork - analysis of real case studiesModalità Valutazione
The final evaluation is based on the development of some projects and on a written test lasting 1 hour. - The projects are carried out in groups and consist of both solving problems assigned by the teacher and carrying out activities agreed with the teacher. - The written test is organized through exercises aimed at verifying the level of effective understanding of the concepts and the students' ability to apply them in real contexts. The tests assigned in previous years are available on the course website.