「Table-related commands in STATA」の版間の差分

提供: Vaccipedia | Resources for Vaccines, Tropical medicine and Travel medicine
ナビゲーションに移動 検索に移動
188行目: 188行目:
 
==Two-way, multiple==
 
==Two-way, multiple==
 
===Summary of ''factorA'', ''factorB'', ''factorC'' based on ''disease''===
 
===Summary of ''factorA'', ''factorB'', ''factorC'' based on ''disease''===
 +
{|class="wikitable"
 +
|-
 +
!rowspan="3"|tabstat
 +
|[[File:tabstat_factorABC_by(disease).jpg]]
 +
|
 +
|-
 +
|[[File:tabstat_factorABC_by(disease)_statistic(sum).jpg]]
 +
|factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C
 +
|-
 +
|[[File:tabstat_factorABC_by(disease)_statistic(n).jpg]]
 +
|''statistic(n)'' (''statistic(count)'' is the same) only counts observations with real values, which only tell non-missing observations
 +
|}

2023年4月2日 (日) 18:15時点における版

Abbreviations of commands

table (no abbv.)
tabulate ta
tab
tabstat (no abbv.)
summarize su

Differences between table, tabulate, tabstat, summarize

one-way two-way options
table
table v1

create a one-way table
of v1 with simple statistics

table v1 v2

create a two-way table
of v1 in row† and v2 in column†

,statistics( )
tabulate
tabulate v1

create a one-way table
of v1 with detailed statistics

tabulate v1 v2

create a two-way table
with v1 in row† and v2 in column†

,chi2

Pearson's chi-squared test; *only for two-way

,summarize(v3)

detailed statistics for v3

tabstat
tabstat v1

create a one-way table of v1
with detailed statistics

*no two- or multiple-way table
,statistics( )
,by(v3)

detailed statistics for each of v3

summarize
summarize v1

detailed statistics of v1

*no two- or multiple-way summary
,detail

† row = transverse direction, column = longitudinal direction

Sample data

Suppose we have such a dataset in STATA.

STATAsample.jpg

One-way

Summary of sex, a binary variable

table Table sex.jpg Both reports frequency
but tabulate is more detailed
tabulate Tabulate sex.jpg
tabstat Tabstat sex.jpg Both reports mean
but summarize is more detailed
summarize Summarize sex.jpg

Summary of data1, a continuous variable

table Table data1.jpg Both reports frequency of each value,
which does not make sense
tabulate Tabulate data1.jpg
tabstat Tabstat data1.jpg Both reports mean
but summarize is more detailed
summarize Summarize data1.jpg

Summary of SES, a categorical variable

table Table SES.jpg Both reports frequency
but tabulate is more detailed
tabulate Tabulate SES.jpg
tabstat Tabstat SES.jpg Both reports mean
but summarize is more detailed
summarize Summarize SES.jpg

One-way, multiple

table *Both do not create one-way multiple table
tabulate
tabstat Tabstat sex data1 SES.jpg Reports mean in row (transverse) direction
summarize Summarize sex data1 SES.jpg Reports more details in column (longitudinal) direction

Two-way

Summary of factorA based on sex

table Table sex factorA.jpg Both creates the same table
but tabulate is better visualized
tabulate Tabulate sex factorA.jpg
tabstat Tabstat factorA by(sex).jpg Both reports mean
but summarize is more detailed;
needs bysort option before the command
summarize Summarize factorA bysort sex.jpg

Summary of sex based on factorA

table Table factorA sex.jpg Both creates the same table
but tabulate is better visualized
tabulate Tabulate factorA sex.jpg
tabstat Tabstat sex by(factorA).jpg Both reports mean
but summarize is more detailed;
needs bysort option before the command
summarize Summarize sex bysort factorA.jpg

Summary of data1 based on disease

table *Both do not create a meaningful table for continuous variable
tabulate
tabstat Tabstat data1 by(disease).jpg Both reports mean
but summarize is more detailed;
needs bysort option before the command
summarize Summarize data1 bysort disease.jpg

Two-way, multiple

Summary of factorA, factorB, factorC based on disease

tabstat Tabstat factorABC by(disease).jpg
Tabstat factorABC by(disease) statistic(sum).jpg factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C
Tabstat factorABC by(disease) statistic(n).jpg statistic(n) (statistic(count) is the same) only counts observations with real values, which only tell non-missing observations