TFX is an end-to-end platform for deploying production ML pipelines
-
Updated
Nov 5, 2024 - Python
TFX is an end-to-end platform for deploying production ML pipelines
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Yet Another UserAgent Analyzer
[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Tools to make weather data accessible and useful.
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Clojure API for a more dynamic Google Dataflow
Collection of transforms for the Apache beam python SDK.
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Microservices in Post-Kubernetes Era. A polyglot monorepo
Some class materials for a data processing course using PySpark
Blockchain ETL Architecture
Opinionated serverless event analytics pipeline
Add a description, image, and links to the apache-beam topic page so that developers can more easily learn about it.
To associate your repository with the apache-beam topic, visit your repo's landing page and select "manage topics."