Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up
Pinned repositories
Repositories
-
backstage
Backstage is an open platform for building developer portals
-
styx
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
-
-
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
-
-
github-java-client
A Java client to Github API
-
web-scripts
A collection of base configs and CLI wrappers used to speed up development @ Spotify.
-
-
-
big-data-rosetta-code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
-
featran
A Scala feature transformation library for data science and machine learning
-
magnolify
A collection of Magnolia add-on modules
-
folsom
An asynchronous memcache client for Java
-
dbeam
DBeam exports SQL tables into Avro files using JDBC and Apache Beam
-
-
zoltar
Common library for serving TensorFlow, XGBoost and scikit-learn models in production.
-
-
-
-
-
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-
missinglink
Build time tool for detecting link problems in java projects
-
SPTDataLoader
The HTTP library used by the Spotify iOS client
-
-
ratatool
A tool for data sampling, data generation, and data diffing
-

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
