“”

Explore, Visualise, Model. Better insight starts with Stata

Fast. Accurate. Easy to use. Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualisation, statistics, and automated reporting.

Purchase or Upgrade your Stata

Introducing Stata 19

Purchase or Upgrade your Stata

Buy Stata for business, government, nonprofit, educational, or student use.

System Requirements
OS Windows 10 or macOS 11.0 or newer
Processor Applie Silicon, Intel or AMD processor (Core i3 equivalent or better)
Memory 4GB RAM
Hard Drive 2GB

Data Management

Statistics

Graphics

Why Stata?

Fast. Accurate. Easy to use. Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualization, statistics, and automated reporting.

  • Master your data
  • Publication-quality graphics
  • Automated reporting
  • Truly reproducible research
  • Real documentation
  • Trusted
  • Continuously Updated
  • Easy to use
  • Easy to grow with
  • Easy to automate
  • Easy to extend
  • Advanced programming
  • Community-contributed features
  • World-class technical support
  • Cross-platform compatible
  • Widely used
Master your Data

Stata's data management features give you complete control.

  • Frames — manage multiple datasets simultaneously
  • Import, export
  • JDBC, ODBC, SQL
  • Sort, match, merge, join, append, create
  • Built-in spreadsheet
  • Unicode
  • Process text or binary data
  • Access data locally or on the web
  • Collect statistics across groups
  • BLOBs—strings that can hold entire documents
  • Billions of observations
  • Hundreds of thousands of variables
  • Survival data, panel data, multilevel data, survey data, discrete choice dats multiple-imputation data, categorical data, time-series data
  • And much more, to support all your data science needs.
Publication Quality Graphics

Stata makes it easy to generate publication-quality, distinctly styled graphs.

You can point and click to create a custom graph. Or you can write scripts to produce hundreds or thousands of graphs in a reproducible manner.

Export graphs to EPS or TIFF for publication, to PNG or SVG for the web, or to PDF for viewing.


With the integrated Graph Editor, you click to change anything about your graph or to add titles, notes, lines, arrows, and text.

 

Automated Reporting

All the tools you need to automate reporting your results.

  • Dynamic Markdown document
  • Create Word documents
  • Create PDF documents
  • Create Excel files
  • Customizable tables
  • Schemes for graphics
  • Word, HTML, PDF, SVG, PNG
Truly Reproducible Research

Many People talk about reproducible research. Stata has been dedicated to it for over 30 years.

We constantly add new features; we have even fundamentally changed language elements. No matter. Stata is the only statistical package with integrated versioning. If you wrote a script to perform an analysis in 1985, that same script will still run and still produce the same results today. Any dataset you created in 1985, you can read today. And the same will be true in 2050. Stata will be able to run anything you do today.
We take reproducibility seriously.

 

PyStata - Python Integration

Invoke Python interactively or embed Python in your Stata code.

Invoke Stata from Python and call Stata code from IPython environments.

Use Stata within Jupyter Notebook.

Seamlessly pass data and results between Stata and Python.

Use Stata analyses from within Python.

Use any Python package within Stata

  • Matplotlib and seaborn for visualization
  • Beautiful Soup and Scrapy for web scraping
  • NumPy and pandas for numerical analysis
  • TensorFlow and scikit-learn for machine learning
  • And much more
Real Documentation

When it comes time to perform your analyses or understand the methods you are using, Stata does not leave you high and dry or ordering books to learn every detail.

Each of our data management features is fully explained and documented and shown in practice on real examples. Each estimator is fully documented and includes several examples on real data, with real discussions of how to interpret the results. The examples give you the data so you can work along in Stata and even extend the analyses. We give you a Quick start for every feature, showing some of the most common uses. Want even more detail? Our Methods and formulas sections provide the specifics of what is being computed, and our References point you to even more information.


Stata is a big package and so has lots of documentation – over 18,000 pages in 35 manuals. But don't worry, type help my topic, and Stata will search its keywords, indexes, and even community-contributed packages to bring you everything you need to know about your topic. Everything is available right within Stata.

Trusted

We don't just program statistical methods, we validate them.

The results you see from a Stata estimator rest on comparisons with other estimators, Monte Carlo simulations of consistency and coverage, and extensive testing by our statisticians. Every Stata we ship has passed a certification suite that includes 4.1 million lines of testing code that produces 5.8 million lines of output. We certify every number and piece of text from those 5.8 million lines of output.

Reliable

For over 35 years, StataCorp has been loyal to its users by expanding the Stata software with new statistical methods and the latest in reporting, data visualization, data manipulation, and the user interface. With our long-standing release history, we are committed to continually providing stable and reliable software to our diverse community of researchers and practitioners.

Continuously Updated

Staying on the most up-to-date version of Stata is now easier than ever.

StataCorp continually develops new features to enhance Stata software, from the latest statistical methods to the best in reporting, data visualization, and user interface. With StataNow™, new features are released throughout the current release until the next major release. These features are prioritized in the development cycle to be available as soon as they are ready so that users can take advantage of them right away.

Easy to Use

Staying on the most up-to-date version of Stata is now easier than ever.

All of Stata's features can be accessed through menus, dialogs, control panels, a Data Editor, a Variables Manager, a Graph Editor, and even an SEM Diagram Builder. You can point and click your way through any analysis.
If you don't want to write commands and scripts, you don't have to.
Even when you are pointing and clicking, you can record all your results and later include them in reports. You can even save the commands created by your actions and reproduce your complete analysis later.

Easy to Grow with

Stata's commands for performing tasks are intuitive and easy to learn. Even better, everything you learn about performing a task can be applied to other tasks. For example, you simply add if gender=="female" to any command to limit your analysis to females in your sample. You simply add vce(robust) to any estimator to obtain standard errors and hypothesis tests that are robust to many common assumptions.


The consistency goes even deeper. What you learn about data management commands often applies to estimation commands, and vice-versa. There is also a full suite of postestimation commands to perform hypothesis tests, form linear and nonlinear combinations, make predictions, form contrasts, and even perform marginal analysis with interaction plots. These commands work the same way after virtually every estimator.


Sequencing commands to read and clean data, then to perform statistical tests and estimation, and finally to report results is at the heart of reproducible research. Stata makes this process accessible to all researchers.

Easy to Automate

Everyone has tasks that they do all the time—create a particular kind of variable, produce a particular table, perform a sequence of statistical steps, compute an RMSE, etc. The possibilities are endless. Stata has thousands of built-in procedures, but you may have tasks that are relatively unique or that you want done in a specific way.


If you have written a script to perform your task on a given dataset, it is easy to transform that script into something that can be used on all your datasets, on any set of variables, and on any set of observations.

Easy to extend

Some of the things you automate may be so useful that you want to share them with colleagues or even make them available to all Stata users. That's also easy. With just a little code, you can turn an automation script into a Stata command. A command that supports standard features that Stata's official commands support. A command that can be used in the same way official commands are used.

Advanced Programming

Stata also includes an advanced programming language—Mata.


Mata has the structures, pointers, and classes that you expect in your programming language and adds direct support for matrix programming.


Though you don't need to program to use Stata, it is comforting to know that a fast and complete programming language is an integral part of Stata. Mata is both an interactive environment for manipulating matrices and a full development environment that can produce compiled and optimized code. It includes special features for processing panel data, performs operations on real or complex matrices, provides complete support for object-oriented programming, and is fully integrated with every aspect of Stata. Stata also has comprehensive Python integration, allowing you to harness all the power of Python directly from your Stata code.


Stata also has PyStata, which provides comprehensive Python integration, allowing you to harness all the power of Python directly from your Stata code and to harness all the power of Stata from your Python code.


Stata even let's you incorporate C, C++, and Java plugins in your Stata programs via a native API for each language. And you can even embed Java code directly in your Stata code!

Community-contributed features

Stata is so programmable that developers and users add new features every day to respond to the growing demands of today's researchers.


With Stata's Internet capabilities, new features and official updates can be installed over the Internet with a single click.

World-class technical support

All registered users of the current release of Stata (Stata 18) are eligible for free technical support. If you have not registered your copy of Stata, please fill out the online registration form.


We have a dedicated staff of expert Stata programmers and statisticians to answer your technical questions. From tricky data management solutions to getting your graph looking just right and from explaining a robust standard error to specifying your multilevel model, we have your answers.

Cross-platform compatible

Stata will run on Windows, Mac, and Linux/Unix computers; however, our licenses are not platform specific. That means if you have a Mac laptop and a Windows desktop, you don't need two separate licenses to run Stata. You can install  your Stata license on any of the supported platforms. Stata datasets, programs, and other data can be shared across platforms  without translation. You can also quickly and easily import datasets from other statistical packages, spreadsheets, and databases.

Widely used

Used by researchers for more than 35 years, Stata provides everything you need for data science—data manipulation, visualization, statistics, and automated reporting.

Select your discipline and see how Stata can work for you.

Features For Data Scientists
Features for Economists
Features for Education
Features for Epidemiologists
Features for Biostatisticians
Features for Medical Researchers
Features for Sociologists
Features for Public Health Professionals

What's New in Stata 19

Take your research further with the newest features in Stata 19.


Stata 19 has something for everyone. Below we list the highlights of this release. Stata 19 is unique because most of the new features can be used by researchers in every discipline.

Machine learning via H2O: Ensemble decision trees

With the new h2oml suite, use machine learning via H2O to uncover insights from data when traditional statistical models fall short. Machine learning methods are often used to solve research and business problems focused on prediction.

Conditional average treatment effects (CATE)

With the new cate command, you can go beyond estimating an overall treatment effect to estimating individualized or group-specific ones that address these types of research questions.

High-dimensional fixed effects (HDFE)

Absorb not just one but multiple high-dimensional categorical variables in your linear and fixed-effects linear models with option absorb() of commands areg and xtreg.

Bayesian variable selection for linear regression

With the new bayesselect command, you can perform Bayesian variable-selection for linear regression. This approach offers intuitive interpretation and stable inference, accounting for model uncertainty.

 

 

Marginal Cox PH models for interval-censored multiple- events data

Use the new stmgintcox command to analyse interval-censored multiple-event data.

Meta-analysis for correlations

The meta suite now supports meta-analysis (MA) of a correlation coefficient. All standard meta-analysis features, such as forest plots and subgroup analysis, are supported. 

Correlated random-effects (CRE) model

Want coefficient estimates of time-invariant covariates in your panel-data model? With xtreg, cre, you can now fit a correlated random-effects model.

Panel-data vector autoregressive (VAR) model

With the new xtvar command, you can now fit a panel-data vector autoregressive (VAR) model to analyze the trajectories of related variables when you observe multiple units or panels over time. 

Bayesian bootstrap and replicate weights

You can use the new bayesboot prefix to perform Bayesian bootstrap of statistics produced by official and community-contributed commands. Bayesian bootstrap can incorporate prior information to obtain more precise parameter estimates.

Control-function linear and probit models

Fit control-function linear and probit models with the new cfregress and cfprobit commands. Control-function models offer a more flexible approach to traditional instrumental-variables (IV) methods by including endogenous variables.

Bayesian quantile regression via asymmetric Laplace likelihood

The new bayes: qreg command fits Bayesian quantile regression. The Bayesian framework provides full posterior distributions for quantile regression coefficients that offer comprehensive inference.

Inference robust to weak instruments

Use the new estat weakrobust command to perform reliable inference on endogenous regressors.  

Structural vector autoregressive (SVAR) models via instrumental variables

With the new ivsvar command, you can use instruments instead of short-run constraints to estimate dynamic causal effects.

Instrumental-variables local-projection IRFs

With the new ivlpirf command, you can account for endogeneity when using local projections to estimate dynamic causal effects.

Mundlak specification test

Use the new estat mundlak postestimation command after xtreg to choose between random-effects (RE), fixed-effects (FE), or correlated random-effects (CRE) models even with cluster–robust, bootstrap, or jackknife standard errors. 

Latent class model-comparison statistics

With the new lcstats ncommand, you can use statistics such as entropy and a variety of information criteria, to help you determine the appropriate number of classes.

Do-file Editor: Autocompletion, templates, and more

The Do-file Editor has the following additions: Autocompletion of variable names, macros, and stored results; Code folding enhancements; Temporary and permanent bookmarks; Templates, tabs & Navigation panel.

Graphics: Bar graph CIs, heat maps, and more

New graphics features: Heat maps (twoway); Range and point plot with capped spikes (twoway); Range and point plot with spikes (twoway); Improved labeling, CIs and Control of groupings for Bar Graphs, Dot Charts & Box Plots; Colors by variable for more graphs.

Tables: Easier tabulations, exporting, and more

Easily create and customize tables with titles, notes, and exporting for tables. The table command is a flexible tool for creating tabulations, tables of summary statistics, tables of regression results, and more.

Stata in French

Stata's menus, dialogs, and the like can now be displayed in French. If your computer language is set to French (fr), Stata will automatically use its French setting.

Introducing StataNow™

New Features released at the speed of Stata. With StataNow, you will always have the latest features.

 

StataNow is a continuous-release version of Stata, offering new features as soon as they are ready and ensuring users always have access to the most recent version of Stata.

Straight from development to you. With StataNow, you always have access to the latest features.

What is StataNow?
How can I get StataNow?
How do I know if I already have StataNow?

System Requirements

OS Windows 10 or macOS 11.0 or newer
Processor Applie Silicon, Intel or AMD processor (Core i3 equivalent or better)
Memory 4GB RAM
Hard Drive 2GB

Stata in Your Research

Used by hundreds of thousands of researchers for more than 35 years, Stata provides everything you need for data science—data manipulation, visualization, statistics, and reproducible reporting.

Select your discipline and see how Stata can work for you.

Used by hundreds of thousands of researchers for more than 35 years, Stata provides everything you need for data science—data manipulation, visualization, statistics, and reproducible reporting.

Select your discipline and see how Stata can work for you.

Features For Data Scientists
Features for Economists
Features for Education
Features for Epidemiologists
Features for Biostatisticians
Features for Medical Researchers
Features for Sociologists
Features for Public Health Professionals

The Bulletin

  • Harnessing the Power of ChatGPT-4.0 in Econometrics and Programming: A Game-Changer for Researchers
    Harnessing the Power of ChatGPT-4.0 in Econometrics and Programming: A Game-Changer for Researchers
  • The Green Economy: Can Sustainable Investments Drive Growth?
    The Green Economy: Can Sustainable Investments Drive Growth?
  • The Interest Rate Rollercoaster: How High Will They Go?
    The Interest Rate Rollercoaster: How High Will They Go?
  • Emerging Market Challenges Amid U.S Dollar Strengths
    Emerging Market Challenges Amid U.S Dollar Strengths
  • Stock Market Extremes in 2024
    Stock Market Extremes in 2024
  • Data Science: A Necessity, Not an Option
    Data Science: A Necessity, Not an Option

Posts

  • Introducing StataNow!
    Introducing StataNow!