📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
-
Updated
Mar 20, 2017 - Scala
pagerank hadoop
-
Updated
Aug 20, 2017 - Java
MapReduce in Nodejs
-
Updated
Mar 15, 2017 - JavaScript
Lightweight and extensible library to execute MapReduce-like jobs in Python
-
Updated
Jan 2, 2025 - Python
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
-
Updated
Jun 7, 2023 - Java
Map-Reduce jobs in python to get insightful information from NYC Taxi data
-
Updated
May 4, 2018 - Python
MapReduce Framework based on Storm that is flexible for any MapReduce work. Built with a number of workers and a single master.Used BerkeleyDB as temporary data storage in case of big data processing
-
Updated
Aug 11, 2017 - HTML
Recommends movies to the users based on the users profiles and the ratings of other users.
-
Updated
Feb 27, 2017 - Java
Mapreduce concepts- Secondary sort, counters, mutiple mapreduce jobs
-
Updated
Sep 16, 2017 - Java
Performed business operations using Big data technologies: AWS EMR, AWS RDS (MySQL), Hadoop, Apache Scoop, Apache HBase, MapReduce
-
Updated
Sep 20, 2023 - Python
Hadoop jobs written using GoLang, and run using Hadoop on Docker Containers
-
Updated
May 2, 2018 - Shell
Beta versions/student projects
-
Updated
Jun 19, 2017 - CSS
Cloud and big data 2017/2018: Programming Assignments
-
Updated
Jan 10, 2018 - Python
-
Updated
Apr 13, 2017 - Java
Big Data, Hadoop, and MapReduce in Python. MapReduce Jobs using the MRJob library & Amazon Elastic MapReduce service.
-
Updated
Jan 1, 2023 - Jupyter Notebook
A cloud computing coursework on bigdata etc
-
Updated
Apr 16, 2018 - Python
Big data technologies that I have experimented with
-
Updated
Aug 30, 2017 - Python
Design and implementation of different MapReduce jobs used to analyze a dataset on Covid-19 disease created by Our World In Data
-
Updated
Jun 26, 2023 - Java
-
Updated
Apr 13, 2017 - Java
Count the number of times a word occurs in 1GB (Big Data) Dataset of books using hadoop map-reduce
-
Updated
Apr 20, 2017 - Java