Sie sind auf Seite 1von 38

Thursday,December22,201606:21:58PM1

CREATING GRAPHS FOR THE GAPMINDER DATA


The following table shows the main variables used in the analysis. It has been found missing data particularly in the variable
OILPERPERSON. Also, it shows a SECUNDARY variable created FEMEMPLOYED, aiming to analyze the relationship between female and
male over 15 years old employed within the employed population during 2007. The variable FEMEMPLOYED had 3 figures with the value
more than 1 (bold color figures on the table below), which corresponded to the countries Burundi, Rwanda and Mozambique, these values had
been set to as missing values through recoding the variable. Extending the analysis from the last work-paper posted, there have been added plot
that show visually the relationship between some QUANTITATIVE VARIABLES (dependent variables) and CATEGORICAL VARIABLES (independent
variables) in my Gapminder Code book selection.

Obs Country
1 Afghanistan

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED


.

55.7000

25.6000

0.45961

2 Albania

1915.00

51.4000

42.1000

0.81907

3 Algeria

2231.99

0.4201

50.5000

31.7000

0.62772

4 Andorra

21943.34

5 Angola

1381.00

75.7000

69.4000

0.91678

6 AntiguaandBarbuda

11894.46

7 Argentina

10749.42

0.6359

58.4000

45.9000

0.78596

8 Armenia

1326.74

40.1000

34.2000

0.85287

10 Australia

25249.99

1.9130

61.5000

54.6000

0.88780

11 Austria

26692.98

1.5488

57.1000

49.7000

0.87040

2344.90

0.3592

60.9000

56.2000

0.92282

13 Bahamas

19630.54

66.6000

60.7000

0.91141

14 Bahrain

12505.21

60.4000

30.2000

0.50000

558.06

0.0323

68.1000

53.6000

0.78708

16 Barbados

9243.59

66.9000

60.3000

0.90135

17 Belarus

2737.67

0.6878

53.4000

48.6000

0.91011

18 Belgium

24496.05

48.6000

41.7000

0.85802

19 Belize

3545.65

56.8000

38.8000

0.68310

20 Benin

377.04

71.6000

58.2000

0.81285

62682.15

22 Bhutan

1324.19

58.4000

39.9000

0.68322

23 Bolivia

1232.79

70.4000

61.6000

0.87500

9 Aruba

12 Azerbaijan

15 Bangladesh

21 Bermuda

Thursday,December22,201606:21:58PM2

Obs Country

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED

24 BosniaandHerzegovina

2183.34

41.2000

34.9000

0.84709

25 Botswana

4189.44

46.0000

38.7000

0.84130

26 Brazil

4699.41

0.5997

64.5000

53.3000

0.82636

27 Brunei

17092.46

63.8000

55.5000

0.86991

2549.56

0.5604

47.3000

42.1000

0.89006

29 BurkinaFaso

276.20

81.3000

75.8000

0.93235

30 Burundi

115.31

83.2000

83.3000

1.001202

31 Cambodia

557.95

78.9000

73.4000

0.93029

32 Cameroon

713.64

59.1000

49.0000

0.82910

25575.35

3.0074

63.5000

58.9000

0.92756

1959.84

55.9000

43.6000

0.77996

36 CentralAfricanRep.

239.52

71.3000

62.9000

0.88219

37 Chad

275.88

68.9000

66.3000

0.96226

38 Chile

6334.11

0.8590

51.0000

35.5000

0.69608

39 China

2425.47

0.3195

72.8000

68.9000

0.94643

40 Colombia

3233.42

0.2376

63.2000

54.9000

0.86867

41 Comoros

336.37

68.0000

59.3000

0.87206

42 Congo,Dem.Rep.

103.78

66.2000

50.4000

0.76133

43 Congo,Rep.

1253.29

64.2000

52.3000

0.81464

44 CookIslands

5188.90

58.6000

40.1000

0.68430

591.07

59.9000

38.1000

0.63606

47 Croatia

6338.49

47.1000

39.4000

0.83652

48 Cuba

4495.05

56.0000

43.7000

0.78036

15313.86

59.1000

51.3000

0.86802

7381.31

0.8768

56.0000

47.6000

0.85000

30532.28

1.5675

63.1000

58.1000

0.92076

895.32

53 Dominica

6147.78

54 DominicanRep.

4049.17

52.7000

40.5000

0.76850

55 Ecuador

1728.02

0.7328

59.7000

45.5000

0.76214

56 Egypt

1975.55

0.4475

42.4000

18.2000

0.42925

28 Bulgaria

33 Canada
34 CapeVerde
35 CaymanIslands

45 CostaRica
46 Coted'Ivoire

49 Cyprus
50 CzechRep.
51 Denmark
52 Djibouti

Thursday,December22,201606:21:58PM3

Obs Country

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED

57 ElSalvador

2557.43

58.2000

44.8000

0.76976

58 EquatorialGuinea

8654.54

61.7000

39.6000

0.64182

59 Eritrea

131.80

64.9000

51.7000

0.79661

60 Estonia

6238.54

56.5000

52.1000

0.92212

61 Ethiopia

220.89

80.7000

73.0000

0.90458

2230.68

56.4000

36.8000

0.65248

64 Finland

27110.73

1.9387

57.2000

53.4000

0.93357

65 France

22878.47

1.3283

51.2000

45.6000

0.89062

67 Gabon

4180.77

59.0000

52.6000

0.89153

68 Gambia

354.60

71.7000

65.7000

0.91632

69 Georgia

1258.76

55.6000

48.5000

0.87230

70 Germany

25306.19

1.3985

53.5000

46.8000

0.87477

358.98

65.1000

64.1000

0.98464

73 Greece

13577.88

1.6286

49.6000

37.8000

0.76210

74 Greenland

20751.89

5330.40

76 Guadeloupe

43.1000

38.3000

0.88863

77 Guam

1860.75

62.7000

44.0000

0.70175

79 Guinea

411.50

81.5000

77.6000

0.95215

80 GuineaBissau

161.32

65.9000

49.9000

0.75721

1200.65

58.9000

41.8000

0.70968

371.42

55.9000

35.4000

0.63327

1392.41

56.3000

34.2000

0.60746

35536.07

2.2827

59.0000

51.6000

0.87458

85 Hungary

5634.00

0.6711

47.3000

40.3000

0.85201

86 Iceland

33945.31

73.6000

69.6000

0.94565

786.70

0.1270

55.4000

32.3000

0.58303

88 Indonesia

1143.83

0.2485

61.5000

44.1000

0.71707

89 Iran

2161.55

1.1626

47.8000

26.9000

0.56276

62 FaeroeIslands
63 Fiji

66 FrenchPolynesia

71 Ghana
72 Gibraltar

75 Grenada

78 Guatemala

81 Guyana
82 Haiti
83 Honduras
84 HongKong,China

87 India

Thursday,December22,201606:21:58PM4

Obs Country
90 Iraq

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED


736.27

37.4000

12.4000

0.33155

91 Ireland

27595.09

1.7003

59.9000

51.0000

0.85142

92 Israel

22275.75

51.3000

46.4000

0.90448

93 Italy

18982.27

1.2072

46.4000

35.8000

0.77155

3665.35

58.2000

47.5000

0.81615

95 Japan

39309.48

1.5932

57.3000

46.2000

0.80628

96 Jordan

2534.00

38.9000

13.0000

0.33419

97 Kazakhstan

2481.72

0.7800

63.7000

58.2000

0.91366

98 Kenya

468.70

73.2000

66.6000

0.90984

99 Kiribati

760.26

64.6000

56.0000

0.86687

16372.50

2.0878

58.9000

48.0000

0.81494

6.4676

65.7000

42.0000

0.63927

103 Kyrgyzstan

372.73

58.9000

48.8000

0.82852

104 Laos

554.88

78.2000

78.1000

0.99872

105 Latvia

5011.22

56.8000

50.7000

0.89261

106 Lebanon

6746.61

46.0000

22.3000

0.48478

107 Lesotho

495.73

56.3000

49.8000

0.88455

108 Liberia

155.03

66.0000

53.2000

0.80606

109 Libya

7885.47

48.6000

23.2000

0.47737

81647.10

5332.24

0.8124

53.1000

48.8000

0.91902

112 Luxembourg

52301.59

53.5000

45.9000

0.85794

113 Macao,China

33923.31

63.6000

57.5000

0.90409

2221.19

34.9000

26.8000

0.76791

115 Madagascar

242.68

83.0000

79.2000

0.95422

116 Malawi

184.14

71.8000

69.0000

0.96100

117 Malaysia

5184.71

0.8908

60.5000

43.1000

0.71240

118 Maldives

4038.86

56.9000

42.1000

0.73989

269.89

45.7000

34.4000

0.75274

11066.78

46.8000

30.1000

0.64316

2437.28

42.5000

38.2000

0.89882

94 Jamaica

100 Korea,Dem.Rep.
101 Korea,Rep.
102 Kuwait

110 Liechtenstein
111 Lithuania

114 Macedonia,FYR

119 Mali
120 Malta
121 MarshallIslands
122 Martinique

Thursday,December22,201606:21:58PM5

Obs Country

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED

123 Mauritania

609.13

46.9000

45.3000

0.96588

124 Mauritius

5182.14

54.5000

36.3000

0.66606

125 Mexico

6105.28

0.7706

57.9000

39.6000

0.68394

126 Micronesia,Fed.Sts.

2146.36

595.87

44.3000

43.6000

0.98420

105147.44

772.93

52.1000

50.9000

0.97697

130 Montenegro

2222.34

131 Morocco

1844.35

46.2000

22.2000

0.48052

389.76

77.0000

82.2000

1.067532

74.7000

66.5000

0.89023

2667.25

42.0000

37.3000

0.88810

135 Nauru

136 Nepal

268.26

61.8000

54.6000

0.88350

26551.84

2.9977

61.3000

54.3000

0.88581

138 NetherlandsAntilles

53.5000

47.1000

0.88037

139 NewCaledonia

14778.16

1.5796

65.0000

58.3000

0.89692

141 Nicaragua

948.36

58.8000

36.0000

0.61224

142 Niger

180.08

60.4000

38.0000

0.62914

143 Nigeria

544.60

50.9000

36.5000

0.71709

145 Norway

39972.35

2.1912

65.0000

60.9000

0.93692

146 Oman

11191.81

50.7000

22.6000

0.44576

668.55

0.1181

51.0000

19.0000

0.37255

148 Palau

6243.57

149 Panama

5900.62

59.8000

43.4000

0.72575

744.24

70.4000

68.9000

0.97869

151 Paraguay

1621.18

73.1000

65.3000

0.89330

152 Peru

3180.43

0.2889

68.3000

59.8000

0.87555

153 Philippines

1383.40

0.1405

61.0000

46.8000

0.76721

154 Poland

6575.75

0.6871

48.7000

42.1000

0.86448

11744.83

1.1803

57.6000

50.6000

0.87847

127 Moldova
128 Monaco
129 Mongolia

132 Mozambique
133 Myanmar
134 Namibia

137 Netherlands

140 NewZealand

144 Niue

147 Pakistan

150 PapuaNewGuinea

155 Portugal

Thursday,December22,201606:21:58PM6

Obs Country

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED

156 PuertoRico

15822.11

42.4000

34.6000

0.81604

157 Qatar

33931.83

4.2074

76.0000

39.6000

0.52105

158 Reunion

44.2000

37.9000

0.85747

159 Romania

2636.79

0.4235

49.5000

43.8000

0.88485

160 Russia

2923.14

1.0325

58.8000

53.8000

0.91497

338.27

79.8000

80.5000

1.008772

162 SaintKittsandNevis

9175.80

163 SaintLucia

5248.58

164 SaintVincentandtheGrenadines

4885.05

165 Samoa

1784.07

31993.20

9425.33

4.5723

51.2000

17.7000

0.34570

561.71

65.6000

56.9000

0.86738

1194.71

48.7000

41.1000

0.84394

8614.12

268.33

63.8000

63.4000

0.99373

32535.83

12.2286

62.4000

51.3000

0.82212

8445.53

0.6774

53.4000

45.3000

0.84831

12729.45

55.9000

49.4000

0.88372

1144.10

65.0000

51.3000

0.78923

66.0000

50.7000

0.76818

3745.65

0.5047

41.1000

34.3000

0.83455

15461.76

1.6169

52.5000

41.7000

0.79429

1295.74

55.1000

39.2000

0.71143

523.95

47.3000

27.9000

0.58985

183 Suriname

2668.02

44.7000

30.4000

0.68009

184 Swaziland

1810.23

50.9000

47.1000

0.92534

185 Sweden

32292.48

1.5459

60.7000

56.7000

0.93410

186 Switzerland

37662.75

1.4874

64.3000

57.0000

0.88647

1525.78

44.8000

16.7000

0.37277

2.0065

54.5000

47.1000

0.86422

161 Rwanda

166 SanMarino
167 SaoTomeandPrincipe
168 SaudiArabia
169 Senegal
170 Serbia
171 SerbiaandMontenegro
172 Seychelles
173 SierraLeone
174 Singapore
175 SlovakRepublic
176 Slovenia
177 SolomonIslands
178 Somalia
179 SouthAfrica
180 Spain
181 SriLanka
182 Sudan

187 Syria
188 Taiwan

Thursday,December22,201606:21:58PM7

Obs Country

incomeperperson oilperperson employrate femaleemployrate FEMEMPLOYED

189 Tajikistan

279.18

54.6000

50.1000

0.91758

190 Tanzania

456.39

78.2000

76.1000

0.97315

191 Thailand

2712.52

0.7262

72.0000

65.0000

0.90278

192 TimorLeste

369.57

67.3000

54.7000

0.81278

193 Togo

285.22

63.9000

48.4000

0.75743

194 Tonga

2025.28

10480.82

61.5000

50.5000

0.82114

196 Tunisia

3164.93

41.6000

21.4000

0.51442

197 Turkey

5348.60

0.3945

42.8000

21.9000

0.51168

198 Turkmenistan

2062.13

1.1107

58.5000

53.9000

0.92137

199 Tuvalu

1714.94

200 Uganda

377.42

83.2000

80.0000

0.96154

201 Ukraine

1036.83

0.2552

54.4000

49.4000

0.90809

202 UnitedArabEmirates

21087.39

4.3000

75.2000

37.3000

0.49601

203 UnitedKingdom

28033.49

1.1880

59.3000

53.1000

0.89545

204 UnitedStates

37491.18

2.7385

62.3000

56.0000

0.89888

9106.33

57.5000

46.0000

0.80000

952.83

0.1822

57.5000

52.6000

0.91478

207 Vanuatu

1543.96

208 Venezuela

5528.36

1.2146

59.9000

45.8000

0.76461

722.81

71.0000

67.6000

0.95211

32.0000

11.3000

0.35313

211 Yemen,Rep.

610.36

39.0000

20.3000

0.52051

212 Zambia

432.23

61.0000

53.5000

0.87705

213 Zimbabwe

320.77

66.8000

58.1000

0.86976

195 TrinidadandTobago

205 Uruguay
206 Uzbekistan

209 Vietnam
210 WestBankandGaza

Thursday,December22,201606:21:58PM8

TheCONTENTSProcedure

Attributesofthevariablesanalyzedfollowonthetablebelow:FEMEMPLOYEDexplainsthepercentageofwomenemployedinrelationwith
thetotalemployedinthepopulation(women+men),andFEMEMPLOYED_DIFFexplainshowmanymorewomenareemployedthanmen
inpercentage,inotherwords,therelationshipbetweenwomenandmenemployedinpercentage.

AlphabeticListofVariablesandAttributes
# Variable

Type Len Label

6 FEMEMPLOYED

Num

8 PERCENTAGEOFWOMENEMPLOYEDINTHETOTALEMPLOYEDPOPULATION,
AGE15+

Char

5 percentageemployedfemaleoverorunderemployedmale,Age15+

Char

2 quartilesofTotalEmployeesAge15+

11 FEMEMPLOYED_DIFF
9 QUARTILE_EMPLR

10 QUARTILE_FEMEMPLR Char

2 quartilesofFemaleEmployeesAge15+

7 QUARTILE_GDPPC

Char

2 quartilesofGrossDomesticProductpercapita

8 QUARTILE_OILPC

Char

2 quartilesofOilConsumptionperCapita

1 country

Char

46 COUNTRIESAROUNDTHEWORLD

5 employrate

Num

8 TOTALEMPLOYEESAGE15+(EMPLR),YEAR2007

3 femaleemployrate

Num

8 FEMALEEMPLOYEESAGE15+(FEMEMPLR),YEAR2007

2 incomeperperson

Num

8 GROSSDOMESTICPRODUCTPERCAPITA(GDPPC),YEAR2010

4 oilperperson

Num

8 OILCONSUMPTIONPERCAPITA(OILPC),YEAR2010

Thefollowing4tablesshowsthequartilesofINCOMEPERPERSONwith23missingvalues,OILPERPERSONwith150missingvalues,
EMPLOYRATEwith35missingvaluesandFEMALEEMPLOYRATEwith35missingvaluesaswell.Quartileshadbeenchosentoknowin
whichquartilethevariableismoregrouped.

quartilesofGrossDomesticProductpercapita
Cumulative Cumulative
QUARTILE_GDPPC Frequency Percent Frequency
Percent
Q1

47

24.74

47

24.74

Q2

48

25.26

95

50.00

Q3

48

25.26

143

75.26

Q4

47

24.74

190

100.00

FrequencyMissing=23

Thursday,December22,201606:21:58PM9

TheCONTENTSProcedure
quartilesofOilConsumptionperCapita

Cumulative Cumulative
QUARTILE_OILPC Frequency Percent Frequency
Percent
Q1

16

25.40

16

25.40

Q2

16

25.40

32

50.79

Q3

16

25.40

48

76.19

Q4

15

23.81

63

100.00

FrequencyMissing=150
quartilesofTotalEmployeesAge15+
Cumulative Cumulative
QUARTILE_EMPLR Frequency Percent Frequency
Percent
Q1

43

24.16

43

24.16

Q2

46

25.84

89

50.00

Q3

44

24.72

133

74.72

Q4

45

25.28

178

100.00

FrequencyMissing=35
quartilesofFemaleEmployeesAge15+
Cumulative Cumulative
QUARTILE_FEMEMPLR Frequency Percent Frequency
Percent
Q1

44

24.72

44

24.72

Q2

45

25.28

89

50.00

Q3

44

24.72

133

74.72

Q4

45

25.28

178

100.00

FrequencyMissing=35

Thefollowingtable,FEMEMPLOYED_DIFF,showsthatin161countriesthenumberofwomenemployedexceedmen
employedonage15+,during2007,whichrepresents92%ofthecountriesconsideredinthesampleasshowsinthetablebelow.
Although,therewere35missingvaluesintheoriginaldatatable,hadbeenset3extravaluestoasmissingvalues(totalmissing
values:35+3=38)duetotheunderstandingthatcouldnotbereasonablethefactthatmorewomenwereworkingthanthetotal
actuallypopulationin2007.Thosevaluescouldbewrongdataorduetotherecollectionofthedataindifferenttimes.

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM10

percentageemployedfemaleoverorunderemployedmale,Age15+
Cumulative Cumulative
FEMEMPLOYED_DIFF Frequency Percent Frequency
Percent
OVER
UNDER

161

92.00

161

92.00

14

8.00

175

100.00

FrequencyMissing=38

Thefollowingplotshowsvisuallytherelationshipbetweentwovariables:OILCONSUMPTIONPERCAPITAand
GROSSDOMESTICPRODUCTPERCAPITA,in2010.Theformerisconsideredthedependentvariableandthelatest
theexplanatoryvariable.Itseemstobeapositiverelationship,whichisthefirsthypothesisatthebeginningofthe
Gapminderdataanalysis.Thus,incrementinGDPPCcausesthevariableOILPCtoincreaseaswell.Although,itisno
clearonthisplot,ithasbeenplottedasecondgraphtransformingthequantitativeexplanatoryvariable(GDPPC)ina
categoricalvariabledividedinquartiles(4groups).

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM11

ItisclearinthisplotthepositiverelationshipbetweenGDPPCandOILPCin2010inallcountrieswithinthesample.Note
thatthefirstgroupquartile(Q1)representsthecountrieswithlowestGrossDomesticProductpercapita,whichconsume
lessoilintheworld.Thegroup4(Q4)representsthecountrieswithmayorGDPPCintheworld,thusmayorconsumption
ofoil.Theplotisskewedtotheleft(lefttail),ithasonemode(unimodal),withitshighestpeakonthequartileQ4whichis
layingbetween75%and100%oftheGrossDomesticProductpercapita.

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM12

Thefollowingplotshowsvisuallytherelationshipbetweentwovariables:TOTALEMPLOYEESAGE15+andGROSS
DOMESTICPRODUCTPERCAPITA,in2010.Theformerisconsideredthedependentvariableandthelatestthe
explanatoryvariable.Itisdifficulttoseeapositiveornegativerelationship.Although,itisnoclearonthisplot,ithasbeen
plottedasecondgraphtransformingthequantitativeexplanatoryvariable(GDPPC)intoacategoricalvariabledividedin
quartiles(4groups).

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM13

ItisclearinthisplotthatthereisnotasignificantpositiveornegativerelationshipbetweenGDPPCandEMPLRinall
countrieswithinthesample.Notethatthefirstgroupquartile(Q1)representsthecountrieswithlowestGrossDomestic
Productpercapita,whichgivesemploymenttomostpeople(moreemploymentrate)withintheirowncountry.Inother
words,upto25%ofthecountries(Q1)inthesampleworld,morepeopleareemployedineachofthesecountries,even
thougharethepoorestcountrieswithlessincome.Theplotseemstobeskewedtotheright(righttail),ithasonemode
(unimodal),withitshighestpeakonthequartileQ1whichislayingbetween0%and25%oftheGrossDomesticProduct
percapita.

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM14

Thefollowingplotshowsvisuallytherelationshipbetweentwovariables:FEMALEEMPLOYEESAGE15+andGROSS
DOMESTICPRODUCTPERCAPITA.Theformerisconsideredthedependentvariableandthelatesttheexplanatory
variable.Itisdifficulttoseeapositiveornegativerelationshipasitwasontheprecedingplot.Although,itisnoclearon
thisplot,ithasbeenplottedasecondgraphtransformingthequantitativeexplanatoryvariable(GDPPC)intoa
categoricalvariabledividedinquartiles(4groups).

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM15

ItisclearinthisplotthatthereisnotasignificantpositiveornegativerelationshipbetweenGDPPCandFEMEMPLRin
allcountrieswithinthesample.Notethatthefirstgroupquartile(Q1)representsthecountrieswithlowestGrossDomestic
Productpercapita,whichgivesemploymenttomostwomen(morewomenemploymentrate)withintheirowncountry.In
otherwords,upto25%ofthecountries(Q1)inthesampleworld,morewomenareemployedineachofthesecountries,
eventhougharethepoorestcountrieswithlessincome.Theplotseemstobeskewedtotheright(righttail),ithasone
mode(unimodal),withitshighestpeakonthequartileQ1whichislayingbetween0%and25%oftheGrossDomestic
Productpercapita.

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM16

Thefollowingplotshowsvisuallytherelationshipbetweentwovariables:OILCONSUMPTIONPERCAPITAand
FEMALEEMPLOYEESAGE15+.Theformerisconsideredthedependentvariableandthelatesttheexplanatory
variable.Itisdifficulttoseeapositiveornegativerelationship.Although,itisnoclearonthisplot,ithasbeenplotteda
secondgraphtransformingthequantitativeexplanatoryvariable(FEMEMPLR)intoacategoricalvariabledividedin
quartiles(4groups).

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM17

ItisclearinthisplotthatthereisnotasignificantpositiveornegativerelationshipbetweenOILPCandFEMEMPLRin
allcountrieswithinthesample.Notethatthethirdgroupquartile(Q3)representsthecountrieswiththemeanaverageof
GrossDomesticProductpercapitaof5365.16.Thesecountrieshavethehighestoilconsumptionpercapita.Inother
words,25%ofthecountries(Q3)intheworldsample,consumemoreoilthanintheotherquartiles.Theplotseemstobe
skewedtotheleft(lefttail),ithasonemode(unimodal),withitshighestpeakonthequartileQ3whichislayingbetween
50%and75%oftheFEMALEEMPLOYEESAGE15+.

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM18

Thetablebelowshowtheactuallyfiguresofeachquartileineachvariable.ThevariableINCOMEPERPERSONshowsthatinthe
firstquartile(Q1),thereare47countriesthatrepresentameanof392.29US$percapitawhichisfarmorethantherestofthe
countriesintheotherquartilestogether(75%)

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM19

AnalysisVariable:incomeperpersonGROSSDOMESTICPRODUCTPERCAPITA(GDPPC),
YEAR2010
quartilesofGrossDomestic
Productpercapita

N
Obs

Minimum Maximum

Q1

47

103.7758572 736.2680538

18437.84 392.2944107 172.5321401

Q2

48

744.2394132

2549.56

78970.56

Q3

48

2557.43

9425.33

257527.89

5365.16

2012.60

Q4

47

10480.82

105147.44 1305847.26

27783.98

18032.95

Sum

Mean

StdDev

1645.22 542.9698510

ThevariableOILPERPERSONshowsthat15countrieshavethehigheroilconsumptionwithameanof3.64tonnesperyearperperson.

AnalysisVariable:oilperpersonOILCONSUMPTIONPERCAPITA(OILPC),YEAR2010
quartilesofOilConsumption
perCapita

N
Obs Minimum Maximum

Sum

Mean

StdDev

Q1

16

0.0322815

0.5046595

4.4991570 0.2811973 0.1373993

Q2

16

0.5604235

1.0324699 12.0004301 0.7500269 0.1223396

Q3

16

1.1106715

1.6286149 22.3581617 1.3973851 0.1928498

Q4

15

1.7002618 12.2286450 54.6396164 3.6426411 2.7212177

ThevariablesEMPLOYRATEandFEMALEEMPLOYRATEhavesimilarpatterns,bothconcentratedinthequartileQ4andwelldistributeon
theotherquartilesaswell.Thesevariablesaremeasurein2007forapopulationofAge15+.Thefollowingtablewillexaminetherelationship
betweenfemaleandmaleemployedin2007(variableFEMEPLOYED).

AnalysisVariable:employrateTOTALEMPLOYEESAGE15+(EMPLR),YEAR2007
quartilesofTotalEmployees
Age15+

N
Obs Minimum Maximum

Sum

Mean

StdDev

Q1

43 32.0000000 51.0000000 1938.90 45.0906977 4.4861062

Q2

46 51.2000008 58.5999985 2550.70 55.4500002 2.1950196

Q3

44 58.7999992 64.9000015 2701.70 61.4022729 1.9614290

Q4

45 65.0000000 83.1999969 3245.90 72.1311108 5.7815464

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM20

AnalysisVariable:femaleemployrateFEMALEEMPLOYEESAGE15+(FEMEMPLR),
YEAR2007
quartilesofFemaleEmployees
Age15+

N
Obs Minimum Maximum

Sum

Mean

StdDev

Q1

44 11.3000002 38.2999992 1285.90 29.2250000 8.1957766

Q2

45 38.7000008 47.5000000 1946.80 43.2622218 2.7195662

Q3

44 47.5999985 55.5000000 2264.00 51.4545453 2.1556258

Q4

45 56.0000000 83.3000031 2967.10 65.9355556 8.2075949

TheFEMEMPLOYEDvariableispercentageofwomenemployedinrelationwithmenemployedduring2007.Thetablebelowshowsthatin
161countriesalmost83%(Meancolumn)oftheage15+yearoldpopulationarefemaleemployedwithineachofthesecountries.Ontheother
hand,justineachoftherest14countriesfemaleemployedarelessthanmenemployed,withanaverageofaround42%oneachofthe14
countries.

rson

oyrate
OYED

AnalysisVariable:FEMEMPLOYEDpercentageofwomenemployedinthetotalemployed
population,Age15+
percentageemployedfemaleoveror
underemployedmale,Age15+
OVER
UNDER

N
Obs Minimum Maximum

Sum

Mean

StdDev

161

0.5116822

0.9987212 133.1117107 0.8267808 0.1098859

14

0.3315508

0.5000000

Label

5.8831747 0.4202268 0.0652251

Minimum Maximum

GROSSDOMESTICPRODUCTPERCAPITA(GDPPC),YEAR2010
OILCONSUMPTIONPERCAPITA(OILPC),YEAR2010
TOTALEMPLOYEESAGE15+(EMPLR),YEAR2007
FEMALEEMPLOYEESAGE15+(FEMEMPLR),YEAR2007
PERCENTAGEOFWOMENEMPLOYEDINTHETOTALEMPLOYED
POPULATION,AGE15+

Sum

Mean

103.7758572 105147.44 1660783.55


8740.97
0.0322815 12.2286450 93.4973653 1.4840852
32.0000000 83.1999969
10437.20 58.6359551
11.3000002 83.3000031
8463.80 47.5494381
0.3315508 0.9987212 138.9948854 0.7942565

TheUNIVARIATEProcedure
Variable:incomeperperson(GROSSDOMESTICPRODUCTPERCAPITA(GDPPC),YEAR2010)

Thursday,December22,201606:21:58PM21

TheCONTENTSProcedure
Moments
N

190 SumWeights

190

Mean

8740.96608 SumObservations

1660783.55

StdDeviation

14262.8091 Variance

203427723

Skewness

3.25047792 Kurtosis

14.6656757

UncorrectedSS 5.29647E10 CorrectedSS

3.84478E10

CoeffVariation 163.171999 StdErrorMean

1034.73292

ThetableshowsaStandarddeviation(SD)of14263,whichisveryhighcomparewithitsmeanof8740.ItcanbeusedtheCoefficientof
variation(CV)todetermineitsdispersioninrelationwiththedispersionofothervariables.TheCVforthevariableGDPPCisSDdividedby
themean,soitis1.63.

BasicStatisticalMeasures
Location
Mean

Variability

8740.966 StdDeviation

14263

Median 2553.496 Variance


Mode

203427723

. Range

105044

InterquartileRange

8681

TestsforLocation:Mu0=0
Test

Statistic

Student'st

Sign

pValue

8.447558 Pr>|t|

SignedRank S

95 Pr>=|M| <.0001
9072.5 Pr>=|S|

Quantiles(Definition5)
Level
100%Max

<.0001

Quantile
105147.438

99%

81647.100

95%

33945.314

90%

26901.858

75%Q3

9425.326

50%Median

2553.496

25%Q1

744.239

10%

337.318

<.0001

TheCONTENTSProcedure

Thursday,December22,201606:21:58PM22

Quantiles(Definition5)
Level

Quantile

5%

242.678

1%

115.306

0%Min

103.776

ExtremeObservations
Lowest

Highest

Value Obs

Value Obs

103.776

42

39972.4

145

115.306

30

52301.6

112

131.796

59

62682.1

21

155.033

108

81647.1

110

80 105147.4

128

161.317

MissingValues
PercentOf
Missing
Missing
Value Count AllObs
Obs
.

23

10.80

100.00

ThefollowinggraphshowsthatthedistributionoftheINCOMEPERPERSONidskewedtotheright,reinforcedthetablesabovewhereitis
noticethatthefirstquartile(Q1)concentratedmostoftheincomepercapitavalues.ThedistributionofOILCOMSUMPTIONfollowsthesame
pattern.

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM23

The UNIVARIATE Procedure


Variable: oilperperson (OIL CONSUMPTION PER CAPITA (OILPC), YEAR 2010)

Moments
N

63 SumWeights

63

Mean

1.48408516 SumObservations 93.4973653

StdDeviation

1.82509005 Variance

3.3309537

Skewness

3.86540656 Kurtosis

19.603589

UncorrectedSS 345.277182 CorrectedSS


CoeffVariation 122.977448 StdErrorMean

206.51913
0.22993973

ThetableshowsaStandarddeviation(SD)of1.825,whichisveryhighcomparewithitsmeanof1.484.ItcanbeusedtheCoefficientof
variation(CV)todetermineitsdispersioninrelationwiththedispersionofothervariables.TheCVforthevariableGDPPCisSDdividedby
themean,soitis1.23

Thursday,December22,201606:21:59PM24

TheUNIVARIATEProcedure
BasicStatisticalMeasures
Location
Mean

Variability

1.484085 StdDeviation

1.82509

Median 1.032470 Variance


Mode

3.33095

. Range

12.19636

InterquartileRange

1.12396

TestsforLocation:Mu0=0
Test

Statistic

Student'st

Sign

pValue

6.454235 Pr>|t|

<.0001

31.5 Pr>=|M| <.0001

SignedRank S

1008 Pr>=|S|

Quantiles(Definition5)
Level

Quantile

100%Max

12.2286450

99%

12.2286450

95%

4.2999645

90%

2.9976546

75%Q3

1.6286149

50%Median

1.0324699

25%Q1

0.5046595

10%

0.2484670

5%

0.1404663

1%

0.0322815

0%Min

0.0322815

ExtremeObservations
Lowest

Highest

Value Obs

Value Obs

0.0322815

15

4.20743

157

0.1180921

147

4.29996

202

0.1269788

87

4.57227

168

<.0001

TheUNIVARIATEProcedure
ExtremeObservations
Lowest

Highest

Value Obs

Value Obs

0.1404663

153

6.46757

102

0.1821845

206 12.22864

174

MissingValues
PercentOf
Missing
Missing
Value Count AllObs
Obs
.

150

70.42

100.00

Thursday,December22,201606:21:59PM25

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM26

The UNIVARIATE Procedure


Variable: employrate (TOTAL EMPLOYEES AGE 15+ (EMPLR), YEAR 2007)

Moments
N

178 SumWeights

178

Mean

58.6359551 SumObservations

StdDeviation

10.5194545 Variance

110.658922

Skewness

0.13984206 Kurtosis

0.1660433

UncorrectedSS

631581.82 CorrectedSS

CoeffVariation 17.9402799 StdErrorMean

10437.2

19586.6292
0.78846645

ThetableshowsaStandarddeviation(SD)of10.52,whichishighcomparewithitsmeanof58.64.ItcanbeusedtheCoefficientofvariation
(CV)todetermineitsdispersioninrelationwiththedispersionofothervariables.TheCVforthevariableGDPPCisSDdividedbythemean,
soitis0.18

Thursday,December22,201606:21:59PM27

TheUNIVARIATEProcedure
BasicStatisticalMeasures
Location
Mean

Variability

58.63596 StdDeviation

10.51945

Median 58.70000 Variance


Mode

110.65892

47.30000 Range

51.20000

InterquartileRange

13.80000

Note:Themodedisplayedisthesmallestof7modeswithacountof3.

TestsforLocation:Mu0=0
Test

Statistic

Student'st

Sign

pValue

74.36709 Pr>|t|

SignedRank S

89 Pr>=|M| <.0001
7965.5 Pr>=|S|

Quantiles(Definition5)
Level

<.0001

Quantile

100%Max

83.2

99%

83.2

95%

78.2

90%

73.1

75%Q3

65.0

50%Median

58.7

25%Q1

51.2

10%

44.7

5%

41.6

1%

34.9

0%Min

32.0

<.0001

TheUNIVARIATEProcedure
ExtremeObservations
Lowest

Highest

Value Obs Value Obs


32.0

210

81.3

29

34.9

114

81.5

79

37.4

90

83.0

115

38.9

96

83.2

30

39.0

211

83.2

200

MissingValues
PercentOf
Missing
Missing
Value Count AllObs
Obs
.

35

16.43

100.00

Thursday,December22,201606:21:59PM28

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM29

The UNIVARIATE Procedure


Variable: femaleemployrate (FEMALE EMPLOYEES AGE 15+ (FEMEMPLR), YEAR 2007

Moments
N

178 SumWeights

178

Mean

47.5494381 SumObservations 8463.79998

StdDeviation

14.6257425 Variance

213.912344

Skewness

0.01863437 Kurtosis

0.14664236

UncorrectedSS 440311.418 CorrectedSS

37862.4849

CoeffVariation 30.7590228 StdErrorMean

1.09624575

ThetableshowsaStandarddeviation(SD)of14.63,whichishighcomparewithitsmeanof47.55.ItcanbeusedtheCoefficientofvariation
(CV)todetermineitsdispersioninrelationwiththedispersionofothervariables.TheCVforthevariableGDPPCisSDdividedbythemean,
soitis0.31

Thursday,December22,201606:21:59PM30

TheUNIVARIATEProcedure
BasicStatisticalMeasures
Location
Mean

Variability

47.54944 StdDeviation

14.62574

Median 47.55000 Variance


Mode

213.91234

42.10000 Range

72.00000

InterquartileRange

17.30000

TestsforLocation:Mu0=0
Test

Statistic

Student'st

Sign

pValue

43.3748 Pr>|t|

<.0001

89 Pr>=|M| <.0001

SignedRank S

7965.5 Pr>=|S|

Quantiles(Definition5)
Level

Quantile

100%Max

83.30

99%

82.20

95%

75.80

90%

66.60

75%Q3

56.00

50%Median

47.55

25%Q1

38.70

10%

27.90

5%

21.40

1%

12.40

0%Min

11.30

ExtremeObservations
Lowest

Highest

Value Obs Value Obs


11.3

210

79.2

115

12.4

90

80.0

200

13.0

96

80.5

161

<.0001

TheUNIVARIATEProcedure
ExtremeObservations
Lowest

Highest

Value Obs Value Obs


16.7

187

82.2

132

17.7

168

83.3

30

MissingValues
PercentOf
Missing
Missing
Value Count AllObs
Obs
.

35

16.43

100.00

Thursday,December22,201606:21:59PM31

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM32

The UNIVARIATE Procedure


Variable: FEMEMPLOYED (PERCENTAGE OF WOMEN EMPLOYED IN THE TOTAL EMPLOYED
POPULATION, AGE 15+)

Moments
N

175 SumWeights

175

Mean

0.79425649 SumObservations 138.994885

StdDeviation

0.15380568 Variance

0.02365619

Skewness

1.2814384 Kurtosis

1.03938886

UncorrectedSS 114.513766 CorrectedSS

4.11617641

CoeffVariation 19.3647366 StdErrorMean

0.01162662

ThetableshowsaStandarddeviation(SD)of0.154,whichishighcomparewithitsmeanof0.79.ItcanbeusedtheCoefficientofvariation
(CV)todetermineitsdispersioninrelationwiththedispersionofothervariables.TheCVforthevariableGDPPCisSDdividedbythemean,
soitis0.19.Comparewithothervariables,itisclearthatthevariablewiththemajordispersionincomparisonwithotheristheonewiththe
highestCV,whichistheGROSSDOMESTICPRODUCTPERCAPITAvariable.

Thursday,December22,201606:21:59PM33

TheUNIVARIATEProcedure

BasicStatisticalMeasures
Location
Mean

Variability

0.794256 StdDeviation

0.15381

Median 0.850000 Variance

0.02366

Mode

0.66717

. Range

InterquartileRange 0.18178

TestsforLocation:Mu0=0
Test

Statistic

Student'st

Sign

pValue

68.31364 Pr>|t|

SignedRank S

87.5 Pr>=|M| <.0001


7700 Pr>=|S|

Quantiles(Definition5)
Level

<.0001

Quantile

100%Max

0.998721

99%

0.993730

95%

0.962264

90%

0.934102

75%Q3

0.898876

50%Median

0.850000

25%Q1

0.717092

10%

0.521053

5%

0.459605

1%

0.334190

0%Min

0.331551

<.0001

TheUNIVARIATEProcedure
ExtremeObservations
Lowest

Highest

Value Obs

Value Obs

0.331551

90 0.978693

150

0.334190

96 0.984199

127

0.345703

168 0.984639

71

0.353125

210 0.993730

173

0.372549

147 0.998721

104

MissingValues
PercentOf
Missing
Missing
Value Count AllObs
Obs
.

38

17.84

100.00

Thursday,December22,201606:21:59PM34

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM35

LIBNAME mydata "/courses/d1406ae5ba27fe300 " access=readonly;


DATA NEW;
SET MYDATA.GAPMINDER;
keep COUNTRY INCOMEPERPERSON QUARTILE_GDPPC OILPERPERSON QUARTILE_OILPC EMPLOYRATE
QUARTILE_EMPLR FEMALEEMPLOYRATE QUARTILE_FEMEMPLR FEMEMPLOYED FEMEMPLOYED_DIFF;
LABEL INCOMEPERPERSON = "GROSS DOMESTIC PRODUCT PER CAPITA (GDPPC), YEAR 2010"
OILPERPERSON = "OIL CONSUMPTION PER CAPITA (OILPC), YEAR 2010"
EMPLOYRATE = "TOTAL EMPLOYEES AGE 15+ (EMPLR), YEAR 2007"
FEMALEEMPLOYRATE = "FEMALE EMPLOYEES AGE 15+ (FEMEMPLR), YEAR 2007"
FEMEMPLOYED = "PERCENTAGE OF WOMEN EMPLOYED IN THE TOTAL EMPLOYED POPULATION,
AGE 15+"
COUNTRY = "COUNTRIES AROUND THE WORLD"
QUARTILE_GDPPC = "quartiles of Gross Domestic Product per capita"
QUARTILE_OILPC = "quartiles of Oil Consumption per Capita"
QUARTILE_EMPLR = "quartiles of Total Employees Age 15+"
QUARTILE_FEMEMPLR = "quartiles of Female Employees Age 15+"
FEMEMPLOYED_DIFF = "percentage employed female over or under employed male,
Age 15+";
/*The variable GDPPC has been divided into 4 groups (Quartiles). Missing values no
considered (23 total).
Therefore, 190 values considered out of 213*/
IF
0 <= INCOMEPERPERSON < 744.239 THEN QUARTILE_GDPPC = 'Q1';

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM36

IF 744.239 <= INCOMEPERPERSON < 2553.496 THEN QUARTILE_GDPPC = 'Q2';


IF 2553.496 <= INCOMEPERPERSON < 9425.326 THEN QUARTILE_GDPPC = 'Q3';
IF
INCOMEPERPERSON >= 9425.326 THEN QUARTILE_GDPPC = 'Q4';
/*The variable OILPC has been divided into 4 groups (Quartiles). Missing values no
considered (150 total).
Therefore, 63 values considered out of 213*/
IF
0 <= OILPERPERSON < 0.5046595 THEN QUARTILE_OILPC = 'Q1';
IF 0.5046595 <= OILPERPERSON < 1.0324699 THEN QUARTILE_OILPC = 'Q2';
IF 1.0324699 <= OILPERPERSON < 1.6286149 THEN QUARTILE_OILPC = 'Q3';
IF
OILPERPERSON >= 1.6286149 THEN QUARTILE_OILPC = 'Q4';
/*The variable EMPLR has been divided into 4 groups (Quartiles). Missing values no
considered (35 total).
Therefore, 178 values considered out of 213*/
IF
0 <= EMPLOYRATE < 51.2 THEN QUARTILE_EMPLR = 'Q1';
IF 51.2 <= EMPLOYRATE < 58.7 THEN QUARTILE_EMPLR = 'Q2';
IF 58.7 <= EMPLOYRATE < 65.0 THEN QUARTILE_EMPLR = 'Q3';
IF
EMPLOYRATE >= 65.0 THEN QUARTILE_EMPLR = 'Q4';
/*The variable FEMALEEMPLOYRATE has been divided into 4 groups (Quartiles). Missing values
no considered (35 total).
Therefore, 178 values considered out of 213*/
IF
0 <= FEMALEEMPLOYRATE < 38.7 THEN QUARTILE_FEMEMPLR = 'Q1';
IF 38.7 <= FEMALEEMPLOYRATE < 47.55 THEN QUARTILE_FEMEMPLR = 'Q2';
IF 47.55 <= FEMALEEMPLOYRATE < 56 THEN QUARTILE_FEMEMPLR = 'Q3';
IF
FEMALEEMPLOYRATE >= 56 THEN QUARTILE_FEMEMPLR = 'Q4';
/*secondary variables estimating the PERCENTAGE OF FEMALE EMPLOYED IN THE TOTAL EMPLOYED
POPULATION, AGE 15 +, during 2007*/
FEMEMPLOYED = FEMALEEMPLOYRATE/EMPLOYRATE;
/*Since FEMEMPLOYED includes 3 values more or equal to 1, they will set to as missing
values */
IF FEMEMPLOYED >= 1 THEN FEMEMPLOYED = .;
/*The variable FEMEPLOYED has been divided into 2 groups (50 percent each). Missing values
no considered (35 + 3 = 38 total).
Therefore, 175 values considered out of 213*/
IF
0 <= FEMEMPLOYED < 0.5 THEN FEMEMPLOYED_DIFF = 'UNDER';
IF
FEMEMPLOYED >= 0.5 THEN FEMEMPLOYED_DIFF = 'OVER';
/* PROC PRINT;
VAR COUNTRY INCOMEPERPERSON QUARTILE_GDPPC OILPERPERSON QUARTILE_OILPC EMPLOYRATE
FEMALEEMPLOYRATE FEMEMPLOYED; */

proc contents data=new; run;


PROC FREQ DATA=NEW;
TABLES QUARTILE_GDPPC;

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM37

PROC FREQ DATA=NEW;


TABLES QUARTILE_OILPC;
PROC FREQ DATA=NEW;
TABLES QUARTILE_EMPLR;
PROC FREQ DATA=NEW;
TABLES QUARTILE_FEMEMPLR;
PROC FREQ DATA=NEW;
TABLES FEMEMPLOYED_DIFF;
PROC GPLOT; PLOT OILPERPERSON*INCOMEPERPERSON;
PROC GCHART; VBAR QUARTILE_GDPPC/discrete TYPE=mean SUMVAR=OILPERPERSON;
PROC GPLOT; PLOT EMPLOYRATE*INCOMEPERPERSON;
PROC GCHART; VBAR QUARTILE_GDPPC/discrete TYPE=mean SUMVAR=EMPLOYRATE;
PROC GPLOT; PLOT FEMALEEMPLOYRATE*INCOMEPERPERSON;
PROC GCHART; VBAR QUARTILE_GDPPC/discrete TYPE=mean SUMVAR=FEMALEEMPLOYRATE;
PROC GPLOT; PLOT OILPERPERSON*FEMALEEMPLOYRATE;
PROC GCHART; VBAR QUARTILE_FEMEMPLR/discrete TYPE=mean SUMVAR=OILPERPERSON;
PROC MEANS DATA=NEW MIN MAX SUM MEAN STD;
CLASS QUARTILE_GDPPC;
VAR INCOMEPERPERSON;
PROC MEANS DATA=NEW MIN MAX SUM MEAN STD;
CLASS QUARTILE_OILPC;
VAR OILPERPERSON;
PROC MEANS DATA=NEW MIN MAX SUM MEAN STD;
CLASS QUARTILE_EMPLR;
VAR EMPLOYRATE;
PROC MEANS DATA=NEW MIN MAX SUM MEAN STD;
CLASS QUARTILE_FEMEMPLR;
VAR FEMALEEMPLOYRATE;
PROC MEANS DATA=NEW MIN MAX SUM MEAN STD;
CLASS FEMEMPLOYED_DIFF;
VAR FEMEMPLOYED;

PROC MEANS DATA=NEW MIN MAX SUM MEAN STD;


VAR INCOMEPERPERSON OILPERPERSON EMPLOYRATE FEMALEEMPLOYRATE FEMEMPLOYED;
PROC UNIVARIATE;
VAR INCOMEPERPERSON OILPERPERSON EMPLOYRATE FEMALEEMPLOYRATE FEMEMPLOYED;
HISTOGRAM;
RUN;

TheUNIVARIATEProcedure

Thursday,December22,201606:21:59PM38

Das könnte Ihnen auch gefallen