This is the code repository for Data Wrangling with Python 3.x [Video], published by Packt. It contains all the supporting project files necessary to work through the video course from start to finish.
You might be working in an organization, or have your own business, where data is being generated continuously (structured or unstructured) and you are looking to develop your skillset so you can jump into the field of Data Science. This hands-on guide shows programmers how to process information. In this course, you will gather data, prepare data for analysis, perform simple statistical analyses, create meaningful data visualizations, and more! This course will equip us with the tools and technologies, also we need to analyze the datasets using Python so that we can confidently jump into the field and enhance our skill set. The best part of this course is the takeaway code templates generated using the real-life dataset. Towards the end of the course, we will build an intuitive understanding of all the aspects available in Python for Data Wrangling.
- Effectively pre-process data (structured or unstructured) before doing any analysis on the dataset.
- Retrieving data from different data sources (CSV, JSON, Excel, PDF) and parse them in Python to give them a meaningful shape.
- Learn about the amazing data storage places in an industry which are being highly optimized.
- Perform statistical analysis using in-built Python libraries.
- Hacks, tips, and techniques that will be invaluable throughout your Data Science career.
To fully benefit from the coverage included in this course, you will need:
This course is for Python developers, data analysts, and IT professionals who are keen to explore data analytics/insights to enrich their current personal or professional projects.
Having a rudimentary idea about relational database and SQL would be a bonus. Even seasoned Python developers can benefit from this course as it focuses on data engineering aspects.
This course has the following software requirements:
python 3