About us

Hi! We are a work group coursing Cloud & Big Data on the Universidad Complutense de Madrid. We are five students in Software and Videogame Development that just started diving in the ocean of possibilities that is the Cloud Computing and Big Data world.

Objective and use of Big Data

Initially, we did a big brainstorming session about the theme of our final proyect, searching far and wide on Kaggle datasets that could be worth the exploration and research and could give a nice topic to talk about.
Finally, we stood upon great datasets that contained thousands of entries about videogames, companies, prices and much more interesting data. Our main source of information is the dataset steam.csv:

We also complimented this massive collection of entries with other datasets that helped us have a more insightful knowledge and let us provide an understandable research conclusion.

Final thoughts about the project

When we started the project, we had some difficulties due to our ignorance with the methods and technologies we had to use. Some of our members had problems with Linux, the Cloud instances or had never used Python.

We weren't really happy with the data we used either, because some of the applications from the data were not complete or had some information that gave us more problems when analyzing everything. For example, videogames without playtime or genres with confusing tag/names.

Despite this, we still managed to come with a well made project and to cover all the sections we planned to study. We are satisfied with the results, but we would have liked to offer an easier way to access our code results through this webpage, although we didn't have enough web engeneering knowledge to make some of our code functionality to work on the web.