StudentShare solutions

new drinking laws in the UK. The article from "The Sun" is titled "Sup all night". The article from "Daily Mail" is titled "Police braced for the great British binge" (see references for more detail).
The research consists of the following steps. First, I select a sample of 100 words from each article. I count the word length and frequency of the same length words putting it into the summary table and analyze the findings. Then I do the same procedure for 200 words and 400 words. The reason why I decided to split my analysis into those 3 consecutive steps is in order to see any possible changes in my statistical indicators (such as mean, median, mode). On average, they should not volatile drastically for each article when moving to a larger size sample. But they should become more accurate as in a larger-size samples random differences should smooth out.
As was noted above, for each step sample size I calculate mean, median and mode. The mean shows me what the average word length in the sample is by merely dividing the total number of letters in the sample by the total number of words. So it can be any decimal number, like 4.53. It doesn't tell me the exact number of letters in the word (as there are no words with 4.53 letters), but it gives a good estimation of distribution of letters across the words.
However, the mean could yield a bit misleading results if the data distribution is skewed to the left or right. Then the outliers will have too big weigh in contribution to mean, distorting the real picture. To deal with this I also calculate the median, which shows the length of the word that is in the middle of the sample. If I range all 100 words from the sample starting from 1-letter words to 14-letter words, the median will be the length of the word in the centre.
The mode simply shows the word length of the most frequently met words with the same length.
Assumptions
I did the following assumptions when selecting the samples and calculating the word lengths:
1) I do not count punctuation marks (i.e. commas, periods, questions marks, quotes, etc. are taken into account).
2) A word with a hyphen is counted as if it is a word without a hyphen (e.g. shake-up is regarded as a seven-letter word).
3) The apostrophe in a word is not counted (e.g. labour's is regarded as a seven-letter word).
4) When I encounter a number, a date or a time, I take it into account. The number of symbols in it becomes the word length (e.g. 10500 is regarded as a five-letter word, 11pm is regarded as a four-letter word).

Analysis
1) Based on the sample of 100 words from each articles I received the following data:

I can conclude that there are quite some differences here. All mean, median and mode are slightly higher for "Daily Mail" that for "The Sun". However, this difference is certainly not significant. In each article there is a similar tendency of high frequency of short-length words. The words up to 5 letters (including 5) account for 75% of all words for "The Sun" and for 61% of all words for "Daily Mail". So based on this sample we can argue that the author of "Daily Mail" article on average uses longer words than that in "The Sun".
The histograms below show the distribution of frequencies for each article.

Words with 3 and 4 letters each account for more than 20% of all the words in the article.

No category account for more than 20% of all the words (unlike in "The Sun"). Each 2-, 3-, 4- 5- and 7-letter words has more than 12% share of the total ...Show more

## Summary

The purpose of this paper is to analyze the word length in articles from two different newspapers using statistics. My hypothesis is that the average word length in both articles is approximately the same. It should be proved true or false by my research.
Author : miguelconroy
Save Your Time for More Important Things
Let us write or edit the research paper on your topic
with a personal 20% discount.
Grab the best paper

### Related Essays

Phonemes as a part of all languages
I remember a song my grandmother use to sing, “You like po-tay-to, I like po-tah-to”. It was a simple verse to a simple song, “Let’s Call the Whole Thing Off.” But its main purpose was to show the humor in different dialects; mostly American vs British back then.
9 pages (2250 words) Research Paper
I will also cover the different types of coal, how we get it and how and why we use it. There are many benefits to the usage of coal worldwide, and herein will discover some of those benefits. II. History The history of coal is extensive, dating back to 1575 when Sir George Bruce of Scotland opened the very first coal mine to obtain coal from a moat pit beneath the sea on the Firth of Forth (Undiscovered Scotland, 2011).
7 pages (1750 words) Research Paper
How to Teach English as a Second Language (ESL) Students to Read and Reading Comprehension
These students face unique challenges in learning to read and that they in turn pose particular challenges for their teachers (Reutzel & Cooter, 2003; Shanahan & Beck, 2006). These students are regularly challenged with the linguistic complexities of the English language as they try to not only master basic literacy skills, but to also derive meaning from academic texts.
5 pages (1250 words) Research Paper
Research paper about the Earthquakes Information, time scale, plate tectonic, slope stability, all kinds of it, statistics, num
It focuses on the phenomenon of earthquakes and identifies the main components of earthquakes. In the past, earthquakes have caused severe and considerable damage to properties lives and communities around the globe. This paper therefore analyzes the scope and effects of these natural events and identifies the main elements of it.
15 pages (3750 words) Research Paper
Regarded as the largest religion in the world, its primary beliefs bank on two essential tenets: first, that there is one and true God who exists as the Father, the Son, and the Holy Spirit; and second, that Jesus Christ is the divine and human Messiah who was sent to save the world.
6 pages (1500 words) Research Paper
Do all arguements about abortion come down to the question of what is the moral status of a fetus Explain
Morality however, argues on the basis of what is the right thing to do in relation to codes of conduct set by a given society. Morality and ethics follow religious concerns about life. Even so, it is important to consider reason and scientific prescription when making a crucial decision as this.
3 pages (750 words) Research Paper
Literature Review of Studies Focused on Vocabulary Development Strategies and Interventions for Grades 9-12
This paper reviews three articles to determine applicable interventions for improving vocabulary of Grade 9-12 students. Reading aloud, SSR and material content were some of the considerations identified. Improving Reading Skills of Grades 9-12 Reading strategies are an important consideration in teaching.
7 pages (1750 words) Research Paper
Step 1: Read Each of the Following Questions Psico
In some ways it seems like hypnosis and hypnotherapy could almost be a pseudoscience. The whole concept seems to be a phenomenon and I think it would be difficult to be a full believer unless I had undergone it myself. I think that hypnosis
2 pages (500 words) Research Paper
Simple words are written and learners are taken asked to term them after the teacher each at a time. The challenge here is the ability to make learners understand and sound the words
2 pages (500 words) Research Paper
All the Light we cannot see
e art of writing great and pleasurable, a considerable fraction of them share the ideology that they can use literature to show the world different perceptions to things and particular events. The duration during which the World War took place is largely regarded as a dark
8 pages (2000 words) Research Paper
Get a custom paper written
by a pro under your requirements!
Win a special DISCOUNT!
Put in your e-mail and click the button with your lucky finger