Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
-
Updated
Jan 2, 2024 - Python
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Framework for correlating two or more well logs using feature vectors generated from CNN's in Pytorch
Joblib-like interface for parallel GPU computations (e.g. data preprocessing)
A machine learning exercise using the Spotify "hit predictor" dataset, with data analysis of past "hits" by decade. Deployment using Flask via Heroku.
The aim of this project is to develop a solution using Data science and machine learning to predict the compressive strength of a concrete with respect to the its age and the quantity of ingredients used.
Spam SMS Detection Project implemented using NLP & Transformers. DistilBERT - a hugging face Transformer model for text classification is used to fine-tune to best suit data to achieve the best results. Multinomial Naive Bayes achieved an F1 score of 0.94, the model was deployed on the Flask server. Application deployed in Google Cloud Platform
A GitHub WebCrawler
PyPOLAR is a Python-based app for analyzing polarization-resolved microscopy data to measure molecular orientation and order in biological samples
A Regression Model that predicts a fish's weight based on its specie, length, width & height.
🎃Kaggle-Comptetion-Titanic-Dataset(Codeperfectplus)🏆
predict the winning horse with supervised machine learning models (lucky to have 100% accuracy on small test data)
A bot designed to answer live trivia game questions.
RandomForest Regressor Model ML for predicting Price of House.
An IA model that detects whether a given verse is from the Bible or not
Machine Learning Project for recommendations of music genre based on age and gender
A step-by-step guide to master various aspects of Joblib for parallel computing in Python
magic-wormholing pickled objects made simple
Python scripts to download, process, and analyze the New York City Taxi and Limousine Commission (TLC) Trip Record Data dataset
📊 30 Days of Data Science is a daily challenge to guide you through Data Science essentials. From basics to advanced, this repo offers clear examples, practical exercises, and resources to help you master Data Science, one day at a time. Whether you're new or refining your skills, this challenge has something for you. Join the journey now! 🚀
Add a description, image, and links to the joblib topic page so that developers can more easily learn about it.
To associate your repository with the joblib topic, visit your repo's landing page and select "manage topics."