GitHub - shubhangigupta283/distributed-database-systems: CSE 512

Distributed Database Systems

1. Database Sharding

Simulate data partitioning approaches on-top of an open source relational database management system (PostgreSQL) and build a simplified query processor that access data from the generated partitions. Functions implemented -

rangePartition(), roundRobinPartition()
roundRobinInsert(), rangeInsert()
rangeQuery(), pointQuery()

2. Multithreaded Sort and Join

Implement generic parallel sort and parallel join algorithms for an RDBMS. Functions implemented -

parallelSort(), parallelJoin()

3. Map Reduce

Write a map-reduce program using Hadoop MapReduce framework that performs equijoin on input from a file.

4. Hotspot Analysis

Analyzed NYC taxi trip data on Apache spark distributed system running on HDFS.
Performed geospatial data analysis using Spark SQL by performing point query, range query, Hot zone, and Hot cell analysis.

5. NoSQL database

Perform some textual and spatial searching on MongoDB. Functions implemeted -

FindBusinessBasedOnCity(cityToSearch, saveLocation1, collection)
FindBusinessBasedOnLocation(categoriesToSearch, myLocation, maxDistance, saveLocation2, collection)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Assignment1		Assignment1
Assignment2		Assignment2
Assignment3		Assignment3
Assignment4/Hotspot-Analysis-Apache-Sedona-Template-master		Assignment4/Hotspot-Analysis-Apache-Sedona-Template-master
Assignment5		Assignment5
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Database Systems

1. Database Sharding

2. Multithreaded Sort and Join

3. Map Reduce

4. Hotspot Analysis

5. NoSQL database

About

Releases

Packages

Languages

shubhangigupta283/distributed-database-systems

Folders and files

Latest commit

History

Repository files navigation

Distributed Database Systems

1. Database Sharding

2. Multithreaded Sort and Join

3. Map Reduce

4. Hotspot Analysis

5. NoSQL database

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages