This data set is a sample of Web server statistics for a computer science department. It contains the following 11 sections of data:
Total successful requests
Average successful requests per day
Total successful requests for pages
Average successful requests for pages per day
Total failed requests
Total redirected requests
Number of distinct files requested
Number of distinct hosts served
Corrupt logfile lines
Total data transferred
Average data transferred per day
Write an essay of 2–3 pages that contains the following:
A complete overview of the data, identifying anomalies in different weeks, and the weeks that the data are not regular.
Choose 5 different sections of data, examine these sections, and provide the specific selection process and criteria you used to select this data set.
Provide the measures of tendency and dispersion for each of the 5 different sections of data you selected.
Provide 1 chart or graph for each of the 5 processed sections. This may be a pie or bar chart or a histogram.
Label the chart or graph clearly.
Explain why the graph you provided gave a good visual representation of the data.
Based on your explanation above, identify some specific advantages why, in general, charts and graphs are important in conveying information in a visual format.
Determine the standard deviation and variation, and explain their importance in statistical analysis of a data set.
Based on the tasks you performed in this project, research how statistics are used in information technology (IT), and provide references for your research
Your essay should include proper citation in APA formatting, both in-text, and in reference pages. Include a title page and use 12-point Times New Roman double-spaced font throughout the text
TO BE RE-WRITTEN FROM THE SCRATCH
Get Professionally Written Papers From The Writing Experts
In order to get accurate overview of the data, it was important to start identifying outliers from the first row of the data set. Analysis indicated that week 3 was outlier from the lower site, while week 5 was outlier from the upper site. The findings of week three were summarized in the first five rows of the data set as shown in the table below.
Weeks |
Total Requests |
Average Request Per day |
Total Requests for pages |
Requests for Pages Per day |
Total Failed |
1 |
220506 |
43864 |
56913 |
11321 |
13400 |
2 |
230004 |
46000.8 |
67033 |
13406.6 |
12400 |
3 |
198080 |
39616 |
45002 |
9000.4 |
12334 |
4 |
243058 |
48611.6 |
68500 |
13700 |
15890 |
5 |
332800 |
66560 |
71034 |
14206.8 |
21890 |
6 |
231890 |
46378 |
68456 |
13691.2 |
13450 |
7 |
235450 |
47090 |
75344 |
15068.8 |
13789 |
8 |
245455 |
49091 |
61789 |
12357.8 |
18332 |
9 |
228944 |
45788.8 |
59845 |
11969 |
13450 |
Contrary to this observation, week 5 recorded the highest number of request in the first two rows.In addition, week 5 was not regular as compared to week 3. For instance, the data set in the first two rows recorded the highest value in week 5 more than any other week. However, the pattern was not regular or consistent because the………………………………………………………………………………………………………………………………………………………………………………………………………………………………….
……………………………………………………………………………..Web server statistics …………………………………………………………………………………………………………………………………………………..