ETL-Streaming-Pipelines-Kafka

Creating Streaming Data Pipelines using Kafka

Data Pipeline architecture:

[Toll Plaza] --> Kafka --> [Kafka Consumer (Python)] --> MySQL

Description:

Aim is to de-congest the national highways by analyzing the road traffic data from different toll plazas. As the vehicle passes the toll plaza, the vehicle’s data like vehicle_id,vehicle_type,toll_plaza_id and timestamp are streamed to Kafka. In this project we created a data pipe line that collects the streaming data and loads it into a database.

Process:

Creat MySQL Database server.
Create a table to hold the toll data. see (configure_lab)
Start the Kafka server.
Install the Kafka python driver.
Install the MySQL python driver.
Create a topic to hold the toll data in kafka.
Configure toll plaza program to stream to created topic. (toll_plaza_device)
Create streaming data consumer. (see data_consumer)
configure the consumer program to write into a MySQL database table.
Refresh tle o confirm data streaming.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Configure_lab.sh		Configure_lab.sh
README.md		README.md
data_consumer.py		data_consumer.py
toll_plaza_device.py		toll_plaza_device.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ETL-Streaming-Pipelines-Kafka

Data Pipeline architecture:

Description:

Process:

About

Releases

Packages

Languages

wafemi999/ETL-Streaming-Pipelines-Kafka

Folders and files

Latest commit

History

Repository files navigation

ETL-Streaming-Pipelines-Kafka

Data Pipeline architecture:

Description:

Process:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages