PyStata—Python and Stata

Order

Watch video demo

<- See Stata's other features

Highlights

Use Stata from within Python
Stata API functions to run Stata commands and access Stata data and returned results from Python
IPython magic command to use Stata from Jupyter Notebook
See more programming features

PyStata allows you to invoke Stata directly from any standalone Python environment and to call Python directly from Stata, thus, greatly expanding Stata's Python integration features.

Features in PyStata include

the ability to use Stata from an IPython kernel-based environment like Jupyter Notebook, Spyder IDE, or PyCharm IDE;
the ability to use Stata from Python Shell, like the Windows Command Prompt, the macOS terminal, or the Unix terminal;
four IPython magic commands: stata, mata, pystata, and, in StataNow™, help;
a suite of API functions from within Python to run Stata commands and access Stata data and returned results.

These tools, together with the Stata Function Interface (sfi) module, allow users to easily integrate Stata's vast statistical and data management methods into any data science project using Python.

Let's see it work

Imagine that a health provider is interested in studying the effect of a new hospital admissions procedure on patient satisfaction. They have monthly data on patients before and after the new procedure was implemented in some of their hospitals. The data are in nested JSON format, and the health provider uses Python as the data analysis tool. But they would like to use Stata's new DID regression to analyze the effect of the new admissions procedure on the hospitals that participated in the program. The outcome of interest is patient satisfaction, satisfaction_score, and the treatment variable is procedure.

A portion of did.json is

{
    "hospital_id": "1",
    "month": "7",
    "records": [
        {
            "procedure": "New",
            "satisfaction_score": "4.1065269"
        }
    ]
}

We use the API function in a Python script, did.py, to interact with Stata. Some highlights of the code are

# Setup Stata from within Python
import stata_setup
stata_setup.config("C:/Program Files/Stata18", "se")

# Import the json file into a Python dataframe
with open("did.json") as json_file:
    data = json.load(json_file)
data = json_normalize(data, 'records', ['hospital_id', 'month'])

# Load Python dataframe into Stata
from pystata import stata
stata.pdataframe_to_data(data, True)

# Run Stata commands in Python
stata.run('''
        didregress (satisfaction_score) (procedure), ///
                group(hospital_id) time(month)
        ''', echo=True)

# Load Stata saved results to Python
r = stata.get_return()['r(table)']

# Use them in Python
print("\n")
print("The treatment hospitals had a %5.2f-point increase." % (r[0][0]), end=" ")
print("The result is with 95%% confidence interval [%5.2f, %5.2f]." % (r[4][0], r[5][0]))

# Generate Stata graph in Python
stata.run("estat trendplots", echo=True)
stata.run("graph export did.svg, replace", quietly=True)

Run did.py in Spyder

Here we run did.py, which was created in the above section, in Spyder.

The entire analysis is performed without leaving the Python environment. And with Stata's API functions, data and results flow seamlessly between Python and Stata.

Run did.py in Python Shell

The script can easily be executed in any Python environment, such as the Windows Command Prompt, the macOS terminal, or the Unix terminal. This method uses only the shell environment and does not invoke any GUI element of Stata.

python did.py > did.log

produces a log file, did.log, with output from didregress.

This method is useful for automating tasks in Windows. And the above script can be incorporated into a regularly scheduled task to handle new data.

For a detailed example using Stata in Jupyter Notebook or any Python environment that supports IPython, see Jupyter Notebook with Stata.

Tell me more

Learn more about using Python and Stata together.

Products

New in Stata 18

Why Stata

All features

Disciplines

Stata/MP

StataNow

Order Stata

Purchase

Order Stata

Bookstore

Stata Press

Stata Journal

Gift Shop

Learn

Free webinars

NetCourses

Classroom and web training

Organizational training

Video tutorials

Third-party courses

Web resources

Teaching with Stata

Support

Training

Video tutorials

FAQs

Statalist: The Stata Forum

Resources

Technical support

Customer service

Alerts

Company

News and events

Customer service

Careers

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Privacy policy

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Required cookies

Advertising cookies

Required cookies

These cookies are essential for our website to function and do not store any personally identifiable information. These cookies cannot be disabled.
Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

Accept Cookies

2025 Stata Biostatistics and Epidemiology Virtual Symposium: 20 February