UNDER CONSTRUCTION

DATA SETS

 

Biting Flies (Johnson and Wichern, 1998)

In this data set morphological characteristics of two species of biting flies are collected. The two species are Leptoconops Carteri and Leptoconops Torrens (genus leptoconops). They are so morphologically similar that for many years they have been considered as a unique species.

On 35 units of each species, a multivariate variable composed of 7 dependent taxonomic responses is observed.

Each univariate component is described in the following table:

Variable

Description

Type

 

 

 

WL

wing length (mm)

numeric

 

 

WW

wing width (mm)

numeric

 

 

TPL

third palp length (mm)

numeric

 

 

TPW

third palp width  (mm)

numeric

 

 

FPL

fourth palp length  (mm)

numeric

 

 

LAS

length of antennal

numeric

 

segment (mm)

LAS13

length of antennal

numeric

 

segment 13 (mm)

 

The data set is organized in multiple records including 8 variables; 7 of which are the taxonomic responses and a further variable indicates the species as follows:

Variable

Description

Type

Group

0=Leptoconops Carteri

binary

1=Leptoconops Torrens

WL

WW

TPL

TPW

FPL

LAS

LAS13

Group

85

41

31

13

25

9

8

0

87

38

32

14

22

13

13

0

94

44

36

15

27

8

9

0

92

43

32

17

28

9

9

0

96

43

35

14

26

10

10

0

91

44

36

12

24

9

9

0

90

42

36

16

26

9

9

0

92

43

36

17

26

9

9

0

91

41

36

14

23

9

9

0

87

38

35

11

24

9

10

0

97

45

39

17

27

9

10

0

89

38

36

13

22

9

9

0

94

45

37

13

26

9

9

0

96

44

37

14

24

9

10

0

104

49

35

14

21

10

10

0

94

41

31

17

26

10

9

0

99

44

31

18

28

10

9

0

94

38

32

13

22

9

9

0

94

43

37

16

26

9

10

0

93

43

38

14

28

10

10

0

95

44

37

18

27

10

10

0

95

45

39

13

27

10

10

0

96

39

37

12

26

8

8

0

103

46

34

18

26

10

10

0

108

44

37

14

25

11

11

0

106

47

38

15

26

10

10

0

105

46

34

14

31

10

11

0

103

44

34

15

23

10

10

0

100

41

35

14

24

10

10

0

109

44

36

13

27

11

10

0

104

45

36

15

30

10

10

0

95

40

35

14

23

9

10

0

104

44

34

15

29

9

10

0

90

40

37

12

22

9

10

0

104

46

37

14

30

10

10

0

86

19

37

11

25

9

9

1

94

40

38

14

31

6

7

1

103

48

39

14

33

10

10

1

82

41

35

12

25

9

8

1

103

43

42

15

32

9

9

1

101

43

40

15

25

9

9

1

103

45

44

14

29

11

11

1

100

43

40

18

31

11

10

1

99

41

42

15

31

10

10

1

100

44

43

16

34

10

10

1

112

47

44

16

38

12

11

1

99

48

37

14

32

10

9

1

98

45

41

19

31

9

8

1

101

46

42

14

24

11

10

1

99

45

37

13

28

10

9

1

103

47

44

15

20

8

9

1

98

40

38

12

32

9

8

1

101

46

36

14

28

10

10

1

101

46

40

17

32

9

9

1

98

47

39

15

33

10

10

1

99

45

42

15

32

10

9

1

102

45

44

15

30

10

10

1

97

45

37

15

32

10

9

1

96

39

40

14

20

9

9

1

89

39

33

12

20

9

8

1

99

42

38

14

33

9

9

1

110

45

41

17

36

9

10

1

99

44

35

16

31

10

10

1

103

43

38

14

32

10

10

1

95

46

36

15

31

8

8

1

101

47

38

14

37

11

11

1

103

47

40

15

32

11

11

1

99

43

37

14

23

11

10

1

105

50

40

16

33

12

11

1

99

47

39

14

34

7

7

1

Back


Tumor growth (Barzi, et al., 1999)

This data set is related to an experiment whose purpose was to investigate the anticancer activity of a new taxol like drug . The animal model used are nude mice with an ovarian tumoral mass with a 4-8 mm diameter.

Experimental units have been randomised in four groups differing for the dose of the drug. A control group (1st) has been treated with a physiological solution and three groups (2nd, 3rd, 4th) have been treated with an increasing dose of the drug.

In order to avoid the possible toxic effect of the drug, treatments have been administered a total of 3 times, at intervals of 4 days.

Volumes of tumoral masses are measured before the beginning of the treatment (baseline volume) and after the beginning at intervals of 4 days for a total of 18 times.

Each time the mass is observed the response variable is the relative volume of the tumoral mass with respect to the baseline volume.

Back

Anxiety Reduction Treatment (Landenna et al., 1998)

This data set collects paired data on a training treatment for the reduction of anxiety in a sample of 20 subjects. The latter are presumed to be homogeneous with respect to the most common confounding factors, e.g. sex, age, heath status etc.

Anxiety is measured according to IPAT (Institute for Personality and Ability Testing) in terms of quantitative score. The latter is calculated by the sum of the responses to a set of different items of a standardized psychological test.

Each unit is observed before the training treatment (baseline observation) and one week after a fixed number of training sessions. From these two dependent variables a further dependent variable, indicating the observed individual difference, is calculated.

Variable

Description

Type

 

 

 

YA

baseline

numeric

 

observation

YB

post training

numeric

 

observation

X

YA-YB

numeric

i

YA

YB

X

1

19

14

5

2

22

23

-1

3

18

13

5

4

18

17

1

5

24

20

4

6

30

22

8

7

26

30

-4

8

28

21

7

9

15

11

4

10

30

29

1

11

16

17

-1

12

25

20

5

13

22

18

4

14

19

17

2

15

27

22

5

16

23

21

2

17

24

21

3

18

18

15

3

19

28

24

4

20

27

22

5

Back


Job satisfaction (Pallini and Pesarin, 1992)

This data set is referred to a psychological experiment for comparing the perceived degree of job satisfaction in 20 workers. They were classified by the dichotomous attribute "extroverted / introverted" in two groups respectively of 12 and 8 units. The latter are presumed to be homogeneous with respect to the most common confounding factors, e.g. sex, age, heath status, social status, etc.

The response variable consists of the measure of the perceived degree of job satisfaction in terms of a quantitative score calculated by the sum of sub-responses, each related to a particular sub-aspect.

The data set is organized in multiple records including 2 variables: the response variable and a further variable indicating the group:

Variable

Description

Type

 

 

 

X

perceived degree

numeric

of job satisfaction

Group

0=extroverted

binary

 

1=introverted

 

X

66

57

81

62

61

60

73

59

80

55

67

70

64

58

45

43

37

56

44

42

Group

0

0

0

0

0

0

0

0

0

0

0

0

1

1

1

1

1

1

1

1

 

 

 

Back

Length of worms (Pollard, 1977)

This data set refers to a one-way ANOVA testing problem for the equality in distribution of the length of worms in 3 distinct groups.

The data set is organized in multiple records including 2 variables: the response variable and a further variable indicating the group:

Variable

Description

Type

 

 

 

X

worm's length

numeric

(mm)

Group

1,2,3

numeric

X

10.2

8.2

8.9

8

8.3

8

12.2

10.6

9.9

13

8.1

10.8

11.5

9.2

10.5

9.2

8.7

9

Group

1

1

1

1

1

1

2

2

2

2

2

2

3

3

3

3

3

3

 

Back