M.Tech · Distributed Systems · 2026

Distributed
Data Management

An interactive showcase of data parsing, distributed databases, exploratory analysis pipelines, and federated data architecture.

4Assignments
12+Experiments
5+Technologies

What I Learned

A deep dive into managing data at scale — from raw ingestion to distributed intelligence.

Data Engineering

Mastering heterogeneous data ingestion — CSV, JSON, XML, binary formats, HTML scraping, and SQL/NoSQL CRUD operations with Python.

PandasSQLiteMongoDB

Federated Architecture

Understanding distributed query processing, schema heterogeneity, mediator-wrapper patterns, and CAP theorem tradeoffs.

2PC ProtocolMVCCSaga Pattern

Exploratory Analysis

Pre-processing pipelines with feature engineering, outlier detection, correlation heatmaps on Indian water resource data.

NumPySeabornSklearn

Machine Learning

Applying Linear Regression for predictive modeling with StandardScaler, train-test splits, and performance metrics (MSE, R²).

LinearRegressionR² ScoreMSE

Assignments Showcase

Click on an assignment card to explore the full report, code, and experimental outputs.

Tools & Technologies

Py
Python
Pd
Pandas
SQL
SQLite
M
MongoDB
Sk
Scikit-learn
LaTeX
LaTeX
Np
NumPy
Sns
Seaborn
H
Hadoop
MR
MapReduce
FL
AQUA-Fed