• #### 科普

SCIENCE

#### 英语

ENGLISH

#### 科技

TECHNOLOGY

MOVIE

FOOD

#### 励志

INSPIRATIONS

#### 社会

SOCIETY

TRAVEL

#### 动物

ANIMALS

KIDS

#### 卡通

CARTOON

#### 计算机

COMPUTER

#### 心理

PSYCHOLOGY

#### 教育

EDUCATION

#### 手工

HANDCRAFTS

#### 趣闻

MYSTERIES

CAREER

GEEKS

#### 时尚

FASHION

• 精品课
• 公开课
• 欢迎下载我们在各应用市场备受好评的APP

点击下载Android最新版本

点击下载iOS最新版本

扫码下载译学馆APP

#### “直方图”是怎么来的，你知道吗？

StatQuest: Histograms, Clearly Explained

My cat

does stats

which she sleeps

I like to do stats how about you

when I’m awake

Stat Quest
《征服统计学》
Hello and welcome to stat quest

StatQuest is brought to you by the friendly folks in the genetics department

at the University of North Carolina at Chapel Hill

Today we’re gonna be talking about histograms, and they’re gonna be clearly explained.

Imagine we went out and measured someone

and they were this tall

and then we measured someone else and

Then we measured a whole bunch of people

We’ve measured so many people that the dots overlap

some dots are completely hidden

We could try to make it easier to see the hidden measurements

by stacking any that are exactly the same

But measurements that are the exact same are rare

and a lot of the hidden measurements are still hidden

so instead of stacking measurements that are the exact same

we divide the range of values into bins

And stack the measurements that fall in the same bin

This my friends is a histogram

Bam

The taller the stack within a bin

the more measurements we made that fall into that band

Duh

we can use the histogram to predict the probability

of getting future measurements

I Would be willing to bet that the next measurement we make is somewhere in this range

Measurements out here are rarer

and less likely to happen in the future

If you want to use a distribution to approximate your data or future measurements

Histograms are a good way to justify your decision

By the way

if you don’t know what a distribution is

n is there’s a StatQuest for that.

In this case

we might use a normal distribution

to approximate the data and future measurements

if the data look like this

We might use an exponential distribution

to approximate this data and future measurements

Note

figuring out how wide to make the bins is tricky

If The bins are too narrow, then they are not much help

In this case the bins are so narrow

that pretty much every measurement gets its own bin

This doesn’t give us much more insight than what we had before

so it’s not very useful

And if the bins are too wide

they are not much help

In this case the bins are so wide

that the measurements are split 50/50

all this tells us this how many measurements are above the average,

and how many are below

this is more insight than before,

but we can do better

Sometimes you have to try a bunch of different bin widths

before you get a clear picture

In other words

don’t rely on the default setting of whatever program you’re using to draw the histogram

You’ve got to try a bunch of different settings be

fore you’re sure that you’ve got the best histogram you can draw

Hooray,

we’ve made it to the end of another exciting StatQuest

if you like this StatQuest and wanna see more like it,

and if you have any suggestions for future StatQuests

Just let me know in the comments below until next time

quest on！

YXG-4e45d