Currently: Bash; Python. Useful code for common issues.
Language | Tool/Library | Link | Description |
---|---|---|---|
Bash | File Count | Returns the count of files in a directory | |
Bash | Convert To Numbered List | Orders all files in a directory from 1 to N | |
Bash | Zero Pad | Zero Pads files for sorting in Bash | |
Python | Pandas | Distinction Between Two Dataframes | Various Methods for determining simliarities and differences of dataframes |
Python | Pandas | Common ETL | Commonly used code for ETL |
Python | Pandas/Itertools | Combine Columns | Create new columns by joining data from others |
Python | Numpy | Useful Analysis | Various sums, distances, and the sigmoid functions |
Python | Numpy | Quicksort | Quicksort of two arrays A0 & A1 |
Python | Threading | Multithreading | Multithread to speed up your processing time |
Python | Sklearn | Decision Trees | Uses Decsion Tree Model, returns Mean Absolute Error |
Python | Sklearn | Random Forest | Uses Random Forest Model, returns Mean Absolute Error |
Python | Sklearn | Data Imputation | Sklearn method that uses various metrics for data imputation |
Python | Sklearn | Categorical Encoding | Label Encoding and One-Hot Encoding examples |
Python | Sklearn | Pipeline | Design a pipeline to combine multiple steps |
Python | XGBoost | XGBoost | Extreme Gradient Boosting for high performance predictions |
Python | Multiprocessing | Multiprocessing | Enhance performance by using multiple processes in parallel. |
Python | LightGBM | Light GBM | Gradient boosting framework that uses tree based learning algorithm |
Python | Categorical Encoding | Category Encoding | Useful python package for various tools to encode data |
Python | BigQuery | Perform a Query | Sample implementation of BigQuery from import to query |
Python | OS | Remove Small Images | Open all images in a folder and remove ones that are too small. |