Logo en.artbmxmagazine.com

Statistics applied to the definition of management indicators

Anonim

Probabilistic sampling: It is a sample process in which the elements are chosen by random methods, that is, the selection of the elements for the sample is carried out by random procedures and with known selection probabilities.

applied-statistics-indicators-management

SAMPLING

Sample selection methods

Probabilistic sampling: It is a sample process in which the elements are chosen by random methods, that is, the selection of the elements for the sample is carried out by random procedures and with known selection probabilities.

SAMPLING

Sample selection methods

1-Simple random sampling: It is a selection of samples in which the elements or units are chosen individually and directly through a random process, in which each unselected element has the same opportunity to be chosen as all the others elements in each sample extraction. So each item in the population must have an equal probability of being selected.

SAMPLING

Sample selection methods

2-Systematic sampling: The elements are selected in an ordered way and depend on the number of elements or units included in the population and the size of the sample. Requires the use of a list of all the elements of the population.

SAMPLING

Sample selection methods

3-Stratified sampling: The population is divided into a certain number of subgroups or strata, each of which is sampled independently. The process through which the population is divided into subgroups or strata, is called stratification. The purpose of the stratification is to carry out separate selections in each of the subgroups or strata.

SAMPLING

Sample selection methods

Non-probability sampling: Includes all methods in which the sample elements are not selected by random or random procedures, or with known selection probabilities. Some selection procedures for non-probability sampling are:

  • Judgment sampling: Quota sampling: decisional sampling: Causal grouping sampling:

STATISTICAL DATA

The data of quantitative characteristics: they are those that can be expressed numerically and are obtained through measurements and counts. A quantitative data can be found in any discipline; psychology, accounting, economics, advertising etc

STATISTICAL DATA

The data with quantitative and qualitative characteristics are classified in turn into:

1-Continuous variables:

2-Discrete variables:

STATISTICAL DATA

Data with qualitative characteristics: Data with qualitative characteristics are those that cannot be expressed numerically. These data must be converted to numerical values ​​before working with them.

STATISTICAL DATA

The qualitative characteristics data are classified into:

1-Nominal data: They include categories, such as sex, study career, floor material, qualifications, etc.

2-hierarchical data: subjective evaluations when concepts are ranked according to preference or achievement

STATISTICAL DATA

Frequency distribution: When you have a large mass or quantity of data, it is sometimes very difficult to group them.

This way of organizing information is called frequency distribution and consists of the ordering of the data through classes and frequencies.

STATISTICAL DATA

When the data are presented in a frequency distribution, they are called grouped data. When all the observed data of a variable are listed in a disorganized way, we will call it ungrouped data.

DISTRIBUTION OF

FREQUENCIES

Introduction

The preliminary phase of any statistical study is based on the collection and ordering of data; This is done with the help of numerical and graphical summaries.

Position measurements

These are the measures that help us know where the data is but without indicating how it is distributed.

Steps to build a distribution of frequencies: already known all the theoretical elements necessary for the construction and understanding of a distribution of frequencies we will proceed to show the steps required for its implementation.

3 ° Calculate the number of classes, provided that the class interval is known.

NC = R

Ci

As can be seen in the second and third steps, it would be very difficult to solve these equations by simple mathematical methods since each of them presents two unknowns.

In statistics, a histogram is a graphical representation of a variable in the form of bars, where the surface of each bar is proportional to the frequency of the represented values.

It is used when studying a continuous variable, such as age ranges or sample height, and, for convenience, its values ​​are grouped into classes, that is, continuous values. In cases where the data is qualitative (non-numerical), such as sixth grade agreement or educational level, a sector diagram is preferable.

Frequency polygon

A graph made by joining the midpoints at the top of the columns of a frequency histogram.

Bar graph

It is a set of rectangles or bars separated from each other, because it is used to represent discrete variables; the bars must be of the same base or width and separated at the same distance. They can be arranged vertically and horizontally.

Linear Graph

A line graph is used to represent data series that have been collected at a specific time. The data is plotted on a graph at time intervals and a line is drawn connecting the resulting points.

CIRCULAR GRAPH:

100% bar graph and circle graph: They are especially used to represent the parts into which a total quantity is divided.

The warhead:

The warhead: This graph consists of the representation of the accumulated frequencies of a frequency distribution. It can be built in two different ways; on the “less than” or on the “or more” basis. You can determine the median value of the distribution.

EXAMPLES TO DISCUSS

MEASURES OF ARITHMETIC AVERAGE POSITION

One of the measures that characterize a distribution is the arithmetic mean, which is obtained by adding the entered x i values, and dividing by the total number N

THE STATISTICAL MEDIUM

It is the value of the variable that leaves the same number of data before and after it, once they have been ordered. According to this definition, the data set less than or equal to the median will represent 50% of the data, and those that are greater than the median will represent the other 50% of the total sample data.

THE STATISTICAL MEDIUM

To understand this concept we are going to assume that we have the ordered series of values ​​(2, 5, 8, 10 and 13), the position of the median would be:

THE STATISTICAL MEDIUM

And if the ordered series were (2, 5, 8, 10,13 and 15), the position of the median would be:

That is, position three and a half. Since it is impossible to highlight the three and a half position, it is necessary to average the two values ​​of the third and fourth positions to produce an equivalent median, which in this case correspond to (8 + 10) / 2 = 9. Which would indicate that half of the values ​​are below the value 9 and the other half is above this value.

FASHION

FASHION

The modal measure indicates the value that is most often repeated within the data; that is, if we have the ordered series (2, 2, 5 and 7), the value that is repeated most often is number 2 who would be the mode of the data. It is possible that in some occasions two values ​​are presented with the highest frequency, which is called Bimodal or in other cases more than two values, which is known as multimodal.

Measures of central tendency

Measures of central tendency

They allow us to identify the most representative values ​​of the data, according to the way they tend to be concentrated.

The Average indicates the average of the data; that is, it informs us of the value that each of the individuals would obtain if the values ​​were distributed in equal parts.

Measures of central tendency

The Median, by contrast, informs us of the value that separates the data into two equal parts, each of which has fifty percent of the data.

Finally, Fashion indicates the value that is most repeated within the data

POSITION MEASURES POSITION MEASURES

Position measurements divide a data set into groups with the same number of individuals.

To calculate the position measurements it is necessary that the data are ordered from least to greatest.

POSITION MEASURES

Quartiles

The quartiles are the three values ​​of the variable that divide an ordered data set into four equal parts.

Q1, Q2 and Q3 determine the values ​​corresponding to 25%, 50% and 75% of the data.

Q2 coincides with the median.

POSITION MEASURES

Deciles

Deciles are the nine values ​​that divide the data series into ten equal parts.

The deciles give the values ​​corresponding to 10%, 20%… and 90% of the data.

D5 coincides with the median.

POSITION MEASURES

Percentiles

Percentiles are the 99 values ​​that divide the data series into 100 equal parts.

The percentiles give the values ​​corresponding to 1%, 2%… and 99% of the data.

P50 coincides with the median.

MEASUREMENTS

OF

DISPERSION

MEASURES OF DISPERSION

MEASURES OF DISPERSION

The range is the difference between the largest and smallest of the data in a distribution

statistics

MEASURES OF DISPERSION

The deviation from the mean is the difference between each value of the statistical variable and the arithmetic mean.

D i = x - x

MEASURES OF DISPERSION

The mean deviation is the arithmetic mean of the absolute values ​​of the deviations from the mean

The mean deviation is represented by

MEASURES OF DISPERSION

Variance Properties

  • The variance will always be a positive value or zero, in the event that scores are iguales.Si all values of the variable are added one number the variance does not change.

MEASURES OF DISPERSION

Variance Properties

3 If all the values of the variable are multiplied by a number, the variance is multiplied by the square of that number.

MEASURES OF DISPERSION

Variance Properties

4 If we have several distributions with the same mean and we know their respective variances, the total variance can be calculated.

MEASURES OF DISPERSION

Observations on variance

  • The variance, equal to the average, is a very sensitive index scores extremas.En cases can not find the media it will not be possible to find the variance.

MEASURES OF DISPERSION

Observations on variance

3 The variance is not expressed in the same units as the data, since the deviations are squared.

MEASURES OF DISPERSION

Standard deviation

The standard deviation (σ) measures how far the data is separated.

The formula is easy: it is the square root of the variance. So "what is variance?"

MEASURES OF DISPERSION

Standard deviation Variance

The variance (which is the square of the standard deviation: σ 2) is defined as:

It is the mean of the squared mean differences.

MEASURES OF DISPERSION

Standard deviation

In other words, follow these steps:

Calculate the mean (the average of the numbers)

MEASURES OF DISPERSION

Standard deviation

Now, for each number, subtract the mean and square the result (the squared difference).

MEASURES OF DISPERSION

Standard deviation

Now calculate the mean of those differences squared. (Why squared?)

MEASURES OF DISPERSION

Standard deviation

(Why squared?)

Squared each difference makes all numbers positive (to prevent negative numbers from reducing variance)

MEASURES OF DISPERSION

Standard deviation

And they also make big differences stand out. For example 100 2 = 10,000 is much larger than 50 2 = 2,500.

But squaring them makes the answer very large, so we undo it (with the square root) and so the standard deviation is much more useful

MEASURES OF DISPERSION

Download the original file

Statistics applied to the definition of management indicators