JSWDb

The aim of this project is to extract movies and TV Shows information from IMDb and store them in your own database for further queries. These information are extracted through web scraping, driven by a batch process.

The project follows the principles of an Hexagonal Architecture (aka Ports & Adapters) exposed by Alistair Cockburn. It is compound of the following modules:

The following are the technologies I’ve used for building this project:

The project was built with Maven following a multimodule-based approach, having a parent POM for dependencies management and a child POM for each module.

The below image is a screenshot of the UI in which you can see the progress of the scraping activity:

Job Execution