Popular repositories Loading
-
nlp-datasets
nlp-datasets PublicForked from niderhoff/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
-
Flight_delay_prediction_web_app
Flight_delay_prediction_web_app PublicForked from bennwei/Flight_delay_prediction_web_app
A big data web application to predict USA airline traffic delay with Python, Flask, Apache Spark, Kafka, MongoDB, ElasticSearch, d3.js, scikit-learn, MLlib and Apache Airflow.
Jupyter Notebook 1
-
big_data_for_chimps
big_data_for_chimps PublicForked from infochimps-labs/big_data_for_chimps
A Seriously Fun guide to Big Data Analytics in Practice
Ruby 1
-
SynapsePySparkWordCount
SynapsePySparkWordCount PublicForked from NubeEra-Samples/SynapsePySparkWordCount
Create Spark Job Defination
Jupyter Notebook 1
-
spark-word2vec
spark-word2vec PublicCome see these bad n'boujee flavors with apache spark and word2vec scala
Scala
-
datasets
datasets PublicForked from pulkitsikri/datasets
Datasets that I generally use for trainings, workshops
Repositories
- deequ Public Forked from awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
LambdaML/deequ’s past year of commit activity - SynapsePySparkWordCount Public Forked from NubeEra-Samples/SynapsePySparkWordCount
Create Spark Job Defination
LambdaML/SynapsePySparkWordCount’s past year of commit activity - CheatSheetSeries Public Forked from OWASP/CheatSheetSeries
The OWASP Cheat Sheet Series was created to provide a concise collection of high value information on specific application security topics.
LambdaML/CheatSheetSeries’s past year of commit activity - nd081-c1-provisioning-microsoft-azure-vms-project-starter Public Forked from udacity/nd081-c1-provisioning-microsoft-azure-vms-project-starter
LambdaML/nd081-c1-provisioning-microsoft-azure-vms-project-starter’s past year of commit activity - nd081-c2-Building-and-deploying-cloud-native-applications-from-scratch-project-starter Public Forked from udacity/nd081-c2-Building-and-deploying-cloud-native-applications-from-scratch-project-starter
LambdaML/nd081-c2-Building-and-deploying-cloud-native-applications-from-scratch-project-starter’s past year of commit activity - missing-semester Public Forked from missing-semester/missing-semester
The Missing Semester of Your CS Education 📚
LambdaML/missing-semester’s past year of commit activity - my-mac-os Public Forked from nikitavoloboev/config
List of applications and tools that make my macOS experience even more amazing
LambdaML/my-mac-os’s past year of commit activity