ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
-
Updated
Nov 25, 2024 - Python
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
AzLogDcrIngestPS - Unleashing the power of Log Ingestion API with Azure LogAnalytics custom table v2, Azure Data Collection Rules and Azure Data Ingestion Pipeline
Google Cloud Storage connector, pre-processor and model for predicting user search intent based on keywords
Google Analytics connector, pre-processor and model for predicting churning users for digital publishers.
My experiments with Apache Spark for Humans ⭐
DataStax or Cassandra Ingest from Relational Databases with StreamSets
Sample Azure Data Factory pipeline for ingesting Data Packages directly from the Download API of the Ordnance Survey Data Hub into Azure Storage.
Created a data pipeline using sqoop to ingest data from sql server into the hive table and used hive for feature engineering and analysis.
Ingest any format data into postgreSQL database
A real-life end-to-end cloud sub-system scenario
The multinational retail data contralisation project is a data warehousing project that focuses on ingesting data from disparate sources to create a centralised warehouse
Transform incoming AWS WorkMail email with Excel attachment to CSV and save to S3 bucket
A cryptho currency automated bot
Código fuente: Análisis de Vuelos basado en trabajo de Valliappa Lakshmanan.
ETL process applied on covid-19 dataset of European countries using Azure services such as databricks, keyvault, sql database, data factory etc. Finally power bi dashbaord was also made.
Apache Spark example reading from MSSQL and converting in AVRO format.
Simulating a consultancy project for Repsol, the repository contains both the code notebook and the analysis.
Add a description, image, and links to the ingestion-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the ingestion-pipeline topic, visit your repo's landing page and select "manage topics."