Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.
-
Updated
Aug 9, 2024 - Python
8000
Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.
A scalable, mature and versatile web crawler based on Apache Storm
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
News crawling with StormCrawler - stores content as WARC
[PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Fast Advanced Spam Analysis Tool
A curated list of Pulsar tools, integrations and resources.
Battle-tested Apache Storm Multi-Lang implementation for Python
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Docker image packaging for Apache Storm
A framework for building spouts for Apache Storm and a Kafka based spout for dynamically skipping messages to be processed later.
a suite of benchmark applications for distributed data stream processing systems
Apache Pulsar Adapters
Apache Storm cluster on Docker
Storm Debian Packaging with dpkg-buildpackage
Resources for running StormCrawler with Docker services
Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr
A dockerized image of Apache Storm (Zookeeper, Nimbus, Supervisor, Ui, Logviewer.)
Apache Storm Spout for Redis Streams.
Real time computation system with Apache Storm, Apache Kafka and Google Guice
Add a description, image, and links to the apache-storm topic page so that developers can more easily learn about it.
To associate your repository with the apache-storm topic, visit your repo's landing page and select "manage topics."