Bland klippbergens indianer by George Alfred Henty : Difficulty Assessment for Swedish Learners

How difficult is Bland klippbergens indianer for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 99,790, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Bland klippbergens indianer to have a difficulty score of 53. Here're its scores:

Measure		Score
	easy difficult	(1 - 100)
Overall Difficulty	53%	53
Vocabulary Difficulty	56%	56
Grammatical Difficulty	50%	50

Vocabulary Difficulty: Breakdown

56%

Vocabulary difficulty: 56%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Bland klippbergens indianer's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Bland klippbergens indianer:

Vocabulary difficulty breakdown for Bland klippbergens indianer: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Bland klippbergens indianer:

Measure	Score
Measure	Score
Number of words	99,790
Number of unique words	9,502
Number of recognized words for names/places/other entities	3,799
Number of very rare non-entity words	3,036
Number of sentences	12,753
Average number of words/sentence	8

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 9,311 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Bland klippbergens indianer without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

50%

Grammatical difficulty: 50%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure	Score
Measure	Score
Automated Readability Index	3
Coleman-Liau Index	6
Type/Token Ratio (TTR)	0.09522
Root type/Token Ratio (RTTR)	0.000000954203
Corrected type/Token Ratio (CTTR)	0.000000477102
MTLD Index	64
HDD Index	67
Yule's I Index	77
Lexical Diversity Index (MTLD + HD-D + Yule's I)	69

The type-token ratio (TTR) of Bland klippbergens indianer is 0.09522. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 9,502, while the number of words is 99,790, so the TTR is 9,502 / 99,790 = 0.09522. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 9,502 / (99,790 * 99,790) = 0.000000954203), while in CTTR, it is divided by a square of the number of words, multiplied twice 9,502 / 2 * (99,790 * 99,790) = 0.000000477102). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 3, making it understandable for 3-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 69 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 50.

Other Information about Bland klippbergens indianer by George Alfred Henty

We provide you a sample of the text below, however, the full text of the Bland klippbergens indianer is also available free of charge on our website.

Sample of text:

» Några skålpund mjöl voro fullständigt genomdränkta af vatten, men själfva hufvudförrådet var oskadadt, ty påsen var insydd i björnskinn. Denna öppnades endast, då man behöfde ett förråd för flere dagar, och tillslöts alltid mycket omsorgsfullt. Så snart de landstigit, hade Jagande hunden gått ut på spaning efter drifved, och snart brann en klar eld. Ett stycke björnfett lades i pannan, och* af det våta mjölet bakades genast några tunna kakor, hvilkavoro sega och smaklösa; men förrådet var för dyrbart, att man skulle låta ett enda uns förfaras. »Hvad tänka min hvite broder göra?» frågade höfdingen Harry. »Så vidt jag kan se, hafva vi intet annat att göra än att fortsätta. Vi voro ju alla af den meningen, att det är nästan omöjligt att bana sig väg öfver de där kala bergen. Ännu mindre kunna ...

Top most frequently used words in Bland klippbergens indianer by George Alfred Henty*

Position	Word	Repetitions	Part of all words
Position	Word	Repetitions	Part of all words
1	och	2,928	2.93%
2	att	2,715	2.72%
3	de	1,654	1.66%
4	en	1,593	1.6%
5	det	1,362	1.36%
6	på	1,323	1.33%
7	icke	1,173	1.18%
8	vi	1,061	1.06%
9	af	1,025	1.03%
10	som	981	0.98%
11	han	908	0.91%
12	till	898	0.9%
13	för	892	0.89%
14	jag	878	0.88%
15	den	843	0.84%
16	med	841	0.84%
17	är	829	0.83%
18	om	806	0.81%
19	ett	764	0.77%
20	sig	723	0.72%
21	sade	623	0.62%
22	men	623	0.62%
23	skulle	559	0.56%
24	då	525	0.53%
25	hade	522	0.52%
26	så	513	0.51%
27	var	505	0.51%
28	dem	485	0.49%
29	hafva	466	0.47%
30	Tom	430	0.43%
31	oss	415	0.42%
32	du	367	0.37%
33	där	366	0.37%
34	kunna	360	0.36%
35	kunde	341	0.34%
36	Harry	340	0.34%
37	upp	324	0.32%
38	öfver	323	0.32%
39	kan	312	0.31%
40	har	308	0.31%
41	ni	292	0.29%
42	vara	292	0.29%
43	här	287	0.29%
44	ned	279	0.28%
45	Jerry	269	0.27%
46	komma	264	0.26%
47	mycket	261	0.26%
48	nu	259	0.26%
49	man	258	0.26%
50	mig	257	0.26%
51	några	257	0.26%
52	vid	246	0.25%
53	från	246	0.25%
54	sin	242	0.24%
55	än	241	0.24%
56	ty	240	0.24%
57	andra	236	0.24%
58	voro	235	0.24%
59	honom	232	0.23%
60	två	229	0.23%
61	göra	224	0.22%
62	efter	222	0.22%
63	få	222	0.22%
64	eller	216	0.22%
65	höfdingen	214	0.21%
66	ut	210	0.21%
67	alla	210	0.21%
68	måste	202	0.2%
69	äro	198	0.2%
70	snart	189	0.19%
71	se	185	0.19%
72	sedan	184	0.18%
73	gå	182	0.18%
74	något	181	0.18%
75	sina	175	0.18%
76	utan	166	0.17%
77	hans	165	0.17%
78	fram	163	0.16%
79	någon	162	0.16%
80	varit	161	0.16%
81	dig	160	0.16%
82	endast	157	0.16%
83	hästarna	153	0.15%
84	tror	152	0.15%
85	in	141	0.14%
86	hästen	140	0.14%
87	dalen	140	0.14%
88	deras	134	0.13%
89	Hoppande	134	0.13%
90	höfding	132	0.13%
91	indianerna	131	0.13%
92	åt	130	0.13%
93	innan	128	0.13%
94	hålla	128	0.13%
95	tre	127	0.13%
96	min	125	0.13%
97	skola	125	0.13%
98	Jagande	124	0.12%
99	er	121	0.12%
100	skall	120	0.12%
101	mera	120	0.12%
102	hunden	117	0.12%
103	nog	115	0.12%
104	ännu	115	0.12%
105	litet	112	0.11%
106	denna	111	0.11%
107	blifvit	110	0.11%
108	plats	108	0.11%
109	hvad	106	0.11%
110	väg	105	0.11%
111	genom	105	0.11%
112	ingen	104	0.1%
113	par	101	0.1%
114	många	100	0.1%
115	ju	100	0.1%
116	stor	99	0.1%
117	medan	97	0.1%
118	veta	96	0.1%
119	långt	95	0.1%
120	dessa	95	0.1%
121	under	95	0.1%
122	bort	94	0.09%
123	första	93	0.09%
124	farbror	93	0.09%
125	säkert	93	0.09%
126	blir	93	0.09%
127	Sam	91	0.09%
128	hela	91	0.09%
129	vill	91	0.09%
130	snön	91	0.09%
131	bättre	90	0.09%
132	våra	89	0.09%
133	detta	89	0.09%
134	fyra	89	0.09%
135	taga	88	0.09%
136	tillbaka	88	0.09%
137	tid	87	0.09%
138	sidan	87	0.09%
139	bra	86	0.09%
140	vår	86	0.09%
141	förrän	86	0.09%
142	kommo	85	0.09%
143	kommer	84	0.08%
144	äfven	84	0.08%
145	tills	84	0.08%
146	genast	83	0.08%
147	inte	82	0.08%
148	lika	82	0.08%
149	längre	82	0.08%
150	gick	82	0.08%
151	sitt	82	0.08%
152	gjorde	81	0.08%
153	Pete	81	0.08%
154	gått	80	0.08%
155	Ja	80	0.08%
156	fått	80	0.08%
157	omkring	79	0.08%
158	indianer	79	0.08%
159	väl	79	0.08%
160	mil	79	0.08%
161	dag	79	0.08%
162	aldrig	78	0.08%
163	huru	78	0.08%
164	kom	77	0.08%
165	elden	76	0.08%
166	ur	75	0.08%
167	alldeles	75	0.08%
168	hit	75	0.08%
169	också	75	0.08%
170	kommit	75	0.08%
171	nästan	74	0.07%
172	säga	74	0.07%
173	behöfva	73	0.07%
174	spår	73	0.07%
175	klippan	73	0.07%
176	gjort	73	0.07%
177	länge	73	0.07%
178	hästar	73	0.07%
179	finna	72	0.07%
180	började	72	0.07%
181	ungefär	71	0.07%
182	vore	71	0.07%
183	floden	71	0.07%
184	stanna	71	0.07%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.