差分

ナビゲーションに移動 検索に移動
2,921 バイト追加 、 2023年9月10日 (日) 13:19
編集の要約なし
{{Epi StatFloating_Menu}}
==Standard deviation and Variance==
Find out the relevant coefficient <math>k</math> of <math>\bar{x}\ \pm\ k \frac{s}{\sqrt{n}}</math> from t-distribution table by using desired CI range and degrees of freedom.
 
==Probability, Likelihood==
*[https://sebastianraschka.com/faq/docs/probability-vs-likelihood.html What is the difference between likelihood and probability?]
*[https://rikei-logistics.com/likelihood 確率と尤度の違いは何?]
 
Statistics is an attempt to estimate population through sample.
 
A population always follows some kind of distribution, i.e., follows some kind of probability distribution. But parameters of a population &ndash; e.g., mean and standard deviation in normal distribution, success probability and number of trials in binary distribution, location and scale in logistic distribution, etc. &ndash; are what God knows.
 
And each distribution has each relevant equation to describe its probability distribution, and the equation is derived from parameters.
 
===Probability===
When a sample is derived from a population, parameters of the sample can be calculated, and they are the estimates of parameters of the population.
 
But God knows the true parameters of the population, and parameter of the sample always have random error.
 
An equation relevant to each distribution is derived from the parameters of the sample, and a value the equation makes also has random error.
 
And that value the equation makes is ''probability''. ''Probability'' is the chance that a given observed data is included in the distribution. Or more specifically, it is conditional probability, because a given observed data (= condition) gives a probability of the existing of the data.
 
<!--
Chance that a value <math>x_i</math> in the sample is in a range of certain values (e.g., <math>x_i > n</math>) can also be calculated from the dataset. This chance is <math>probability</math>.
 
:<math>probability = P(x_i\text{ in a range}|\text{parameters})</math>
!-->
 
===Likelihood===
When a sample is derived from a population, observations in the sample should follow the distribution God knows with the parameters God knows, which are impossible to know.
 
And there are multiple possibility of sets of parameters which observed data in the sample can follow, and different sets of parameters have different chances to exist.
 
Those chances are ''likelihood''. A set of parameters can be followed by observed data in the sample with very low chance, another set of parameters can be followed by the sample with relatively high chance, and yet another set of parameters can be followed by the sample with the highest chance.
 
As a natural sense, the highest chance of set of parameters, i.e., the most likely set of parameters (the parameters with the maximum ''likelihood'') is taken into account and is used to make the relevant equation. That is the maximum likelihood estimation method.
 
On the contrary to that the above-mentioned (conditional) ''probability'' fixes the parameters (hypothesis) and varies observations (data), ''likelihood'' fixes observations (data) and varies parameters (hypothesis).

案内メニュー