「Table-related commands in STATA」の版間の差分
ナビゲーションに移動
検索に移動
Vaccipedia.admin (トーク | 投稿記録) |
Vaccipedia.admin (トーク | 投稿記録) |
||
(同じ利用者による、間の9版が非表示) | |||
193行目: | 193行目: | ||
|- | |- | ||
!summarize | !summarize | ||
− | |'''bysort sex: summarize | + | |'''bysort sex: summarize factorA''' |
[[file:summarize_factorA_bysort_sex.jpg]] | [[file:summarize_factorA_bysort_sex.jpg]] | ||
|} | |} | ||
206行目: | 206行目: | ||
|- | |- | ||
!tabulate | !tabulate | ||
− | |'''tabulate sex | + | |'''tabulate factorA sex''' |
[[file:tabulate_factorA_sex.jpg]] | [[file:tabulate_factorA_sex.jpg]] | ||
|- | |- | ||
215行目: | 215行目: | ||
|- | |- | ||
!summarize | !summarize | ||
− | |'''bysort factorA: summarize | + | |'''bysort factorA: summarize sex''' |
[[file:summarize_sex_bysort_factorA.jpg]] | [[file:summarize_sex_bysort_factorA.jpg]] | ||
|} | |} | ||
233行目: | 233行目: | ||
|- | |- | ||
!summarize | !summarize | ||
− | |'''bysort disease: summarize | + | |'''bysort disease: summarize data1''' |
[[file:summarize_data1_bysort_disease.jpg]] | [[file:summarize_data1_bysort_disease.jpg]] | ||
|} | |} | ||
242行目: | 242行目: | ||
|- | |- | ||
!rowspan="3"|table | !rowspan="3"|table | ||
− | |[[file:table_sex_factorA_statistic(percent).jpg]] | + | |'''table sex factorA, statistic(percent)''' |
+ | [[file:table_sex_factorA_statistic(percent).jpg]] | ||
|This calculates proportions of cells compared to the whole<br> without showing raw values | |This calculates proportions of cells compared to the whole<br> without showing raw values | ||
|- | |- | ||
− | |[[file:table_sex_factorA_statistic(percent_across(sex)).jpg]] | + | |'''table sex factorA, statistic(percent, across(sex))''' |
+ | [[file:table_sex_factorA_statistic(percent_across(sex)).jpg]] | ||
|This calculates proportions in column (longitudinal) directions<br> without showing raw values | |This calculates proportions in column (longitudinal) directions<br> without showing raw values | ||
|- | |- | ||
− | |[[file:table_sex_factorA_statistic(percent_across(factorA)).jpg]] | + | |'''tale sex factorA, statistic(percent, across(factorA))''' |
+ | [[file:table_sex_factorA_statistic(percent_across(factorA)).jpg]] | ||
|This calculates proportions in row (transverse) directions<br> without showing raw values | |This calculates proportions in row (transverse) directions<br> without showing raw values | ||
|- | |- | ||
!rowspan="2"|tabulate | !rowspan="2"|tabulate | ||
− | |[[file:tabulate_sex_factorA_column.jpg]] | + | |'''tabulate sex factorA, column''' |
+ | [[file:tabulate_sex_factorA_column.jpg]] | ||
|This calculates proportions in column (longitudinal) directions | |This calculates proportions in column (longitudinal) directions | ||
|- | |- | ||
− | |[[file:tabulate_sex_factorA_row.jpg]] | + | |'''tabulate sex factorA, row''' |
+ | [[file:tabulate_sex_factorA_row.jpg]] | ||
|This calculates proportions in row (transverse) directions | |This calculates proportions in row (transverse) directions | ||
|} | |} | ||
264行目: | 269行目: | ||
|- | |- | ||
!rowspan="3"|tabstat | !rowspan="3"|tabstat | ||
− | |[[File:tabstat_factorABC_by(disease).jpg]] | + | |'''tabstat factorA factorB factorC, by(disease)''' |
+ | [[File:tabstat_factorABC_by(disease).jpg]] | ||
| | | | ||
|- | |- | ||
− | |[[File:tabstat_factorABC_by(disease)_statistic(sum).jpg]] | + | |'''tabstat factorA factorB factorC, by(disease) statistic(sum)''' |
+ | [[File:tabstat_factorABC_by(disease)_statistic(sum).jpg]] | ||
|factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C | |factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C | ||
|- | |- | ||
− | |[[File:tabstat_factorABC_by(disease)_statistic(n).jpg]] | + | |'''tabstat factorA factorB factorC, by(disease) statistic(n)''' |
+ | [[File:tabstat_factorABC_by(disease)_statistic(n).jpg]] | ||
|''statistic(n)'' (''statistic(count)'' is the same) only counts observations with real values, which only tell non-missing observations | |''statistic(n)'' (''statistic(count)'' is the same) only counts observations with real values, which only tell non-missing observations | ||
|} | |} | ||
278行目: | 286行目: | ||
|- | |- | ||
!rowspan="3"|tabstat | !rowspan="3"|tabstat | ||
− | |[[File:tabstat_factorABC_by(SES).jpg]] | + | |'''tabstat factorA factorB factorC, by(SES)''' |
+ | [[File:tabstat_factorABC_by(SES).jpg]] | ||
| | | | ||
|- | |- | ||
− | |[[File:tabstat_factorABC_by(SES)_statistic(sum).jpg]] | + | |'''tabstat factorA factorB factorC, by(SES) statistic(sum)''' |
+ | [[File:tabstat_factorABC_by(SES)_statistic(sum).jpg]] | ||
|factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C | |factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C | ||
|- | |- | ||
− | |[[File:tabstat_factorABC_by(SES)_statistic(n).jpg]] | + | |'''tabstat factorA factorB factorC, by(SES) statistic(n)''' |
+ | [[File:tabstat_factorABC_by(SES)_statistic(n).jpg]] | ||
|''statistic(n)'' (''statistic(count)'' is the same) only counts observations with real values, which only tell non-missing observations | |''statistic(n)'' (''statistic(count)'' is the same) only counts observations with real values, which only tell non-missing observations | ||
|} | |} | ||
293行目: | 304行目: | ||
|- | |- | ||
!tabulate | !tabulate | ||
− | |[[File:tabulate_disease_SES_summarize(data1).jpg]] | + | |'''tabulate disease SES, summarize(data1)''' |
+ | [[File:tabulate_disease_SES_summarize(data1).jpg]] | ||
|This tells means, SDs and frequencies of a continuous variable divided in two-way of binary/categorical variables | |This tells means, SDs and frequencies of a continuous variable divided in two-way of binary/categorical variables | ||
|} | |} |
2023年4月2日 (日) 19:50時点における最新版
目次
Abbreviations of commands
table | (no abbv.) |
---|---|
tabulate | ta tab |
tabstat | (no abbv.) |
summarize | su |
Differences between table, tabulate, tabstat, summarize
one-way | two-way | options | |
---|---|---|---|
table | table v1 create a one-way table of v1 |
table v1 v2 create a two-way table of v1 |
,statistics( ) |
tabulate | tabulate v1 create a one-way table of v1 |
tabulate v1 v2 create a two-way table with v1 |
,chi2 Pearson's chi-squared test; *only for two-way ,summarize(v3) detailed statistics for v3 |
tabstat | tabstat v1 create a one-way table of v1 |
*no two- or multiple-way table | ,statistics( ) ,by(v3) detailed statistics for each of v3 |
summarize | summarize v1 detailed statistics of v1 |
*no two- or multiple-way summary | ,detail |
† row = transverse direction, column = longitudinal direction
Sample data
Suppose we have such a dataset in STATA.
Where,
id | discrete | :Identification number |
---|---|---|
sex | binary | :Male=0, Female=1 |
data1 | continuous | :Results of a certain test |
factorA, B, C | binary | :Negative=0, Positive=1 |
SES | categorical | :Categories of Socio-Economic Status, divided into four |
disease | binary | :Free from a certain disease=0, Having the disease=1 |
One-way
Summary of sex, a binary variable
table | table sex | Both reports frequency but tabulate is more detailed |
---|---|---|
tabulate | tabulate sex | |
tabstat | tabstat sex | Both reports mean but summarize is more detailed |
summarize | summarize sex |
Summary of data1, a continuous variable
table | table data1 | Both reports frequency of each value, which does not make sense |
---|---|---|
tabulate | tabulate data1 | |
tabstat | tabstat data1 | Both reports mean but summarize is more detailed |
summarize | summarize data1 |
Summary of SES, a categorical variable
table | table SES | Both reports frequency but tabulate is more detailed |
---|---|---|
tabulate | tabulate SES | |
tabstat | tabstat SES | Both reports mean but summarize is more detailed |
summarize | summarize SES |
One-way, multiple
table | *Both do not create one-way multiple table | |
---|---|---|
tabulate | ||
tabstat | tabstat sex data1 SES | Reports mean in row (transverse) direction |
summarize | summarize sex data1 SES | Reports more details in column (longitudinal) direction |
Two-way
Summary of factorA based on sex
table | table sex factorA | Both creates the same table but tabulate is better visualized |
---|---|---|
tabulate | tabulate sex factorA | |
tabstat | tabstat factorA, by(sex) | Both reports mean but summarize is more detailed; needs bysort option before the command |
summarize | bysort sex: summarize factorA |
Summary of sex based on factorA
table | table factorA sex | Both creates the same table but tabulate is better visualized |
---|---|---|
tabulate | tabulate factorA sex | |
tabstat | tabstat sex, by(factorA) | Both reports mean but summarize is more detailed; needs bysort option before the command |
summarize | bysort factorA: summarize sex |
Summary of data1 based on disease
table | *Both do not create a meaningful table for continuous variable | |
---|---|---|
tabulate | ||
tabstat | tabstat data, by(disease) | Both reports mean but summarize is more detailed; needs bysort option before the command |
summarize | bysort disease: summarize data1 |
Two-way with proportions
Summary of factorA based on sex with proportions
table | table sex factorA, statistic(percent) | This calculates proportions of cells compared to the whole without showing raw values |
---|---|---|
table sex factorA, statistic(percent, across(sex)) | This calculates proportions in column (longitudinal) directions without showing raw values | |
tale sex factorA, statistic(percent, across(factorA)) | This calculates proportions in row (transverse) directions without showing raw values | |
tabulate | tabulate sex factorA, column | This calculates proportions in column (longitudinal) directions |
tabulate sex factorA, row | This calculates proportions in row (transverse) directions |
Two-way, multiple
Summary of factorA, factorB, factorC based on disease
tabstat | tabstat factorA factorB factorC, by(disease) | |
---|---|---|
tabstat factorA factorB factorC, by(disease) statistic(sum) | factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C | |
tabstat factorA factorB factorC, by(disease) statistic(n) | statistic(n) (statistic(count) is the same) only counts observations with real values, which only tell non-missing observations |
Summary of factorA, factorB, factorC based on SES
tabstat | tabstat factorA factorB factorC, by(SES) | |
---|---|---|
tabstat factorA factorB factorC, by(SES) statistic(sum) | factorA,B,C are binary variables so summations of values provide the positivities of factorA,B,C | |
tabstat factorA factorB factorC, by(SES) statistic(n) | statistic(n) (statistic(count) is the same) only counts observations with real values, which only tell non-missing observations |
Two-way of binary/categorical plus summary of continuous
Summary of data1 based on disease and SES
tabulate | tabulate disease SES, summarize(data1) | This tells means, SDs and frequencies of a continuous variable divided in two-way of binary/categorical variables |
---|