Stars
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
PredictionIO, a machine learning server for developers and ML engineers.
Apache Druid: a high performance real-time analytics database.
akarray / avro-json
Forked from jwills/avro-jsonUtilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.