Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
-
shardingsphere
Distributed database middleware
-
incubator-nuttx
Apache NuttX is a mature, real-time embedded operating system (RTOS)
-
ozone
Scalable, redundant, and distributed object store for Apache Hadoop
-
hudi
Upserts, Deletes And Incremental Processing on Big Data.
-
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
-
servicecomb-java-chassis
ServiceComb Java Chassis is a Software Development Kit (SDK) for rapid development of microservices in Java, providing service registration, service discovery, dynamic routing, and service management features
-
-
-
pulsar
Apache Pulsar - distributed pub-sub messaging system
-
incubator-sedona
A cluster computing framework for processing large-scale geospatial data
-
cloudstack-primate
Primate - modern role-base progressive UI for Apache CloudStack
-
incubator-superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
-
incubator-ratis
Open source Java implementation for Raft consensus protocol.
-
-
maven-invoker-plugin
Apache Maven Invoker Plugin
-

