Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles by Helena Westermarck : Difficulty Assessment for Swedish Learners

How difficult is Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 38,518, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles to have a difficulty score of 64. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 64% 64
Vocabulary Difficulty 71% 71
Grammatical Difficulty 58% 58

Vocabulary Difficulty: Breakdown

71%

Vocabulary difficulty: 71%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles:

Vocabulary difficulty breakdown for Tre Konstnärinnor  - Fanny Churberg, Maria Wiik och Sigrid af Forselles: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles:

Measure Score
Measure Score
Number of words 38,518
Number of unique words 8,307
Number of recognized words for names/places/other entities 1,610
Number of very rare non-entity words 1,767
Number of sentences 5,077
Average number of words/sentence 8

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 8,140 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

58%

Grammatical difficulty: 58%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 6
Coleman-Liau Index 10
Type/Token Ratio (TTR) 0.215665
Root type/Token Ratio (RTTR) 0.00000559908
Corrected type/Token Ratio (CTTR) 0.00000279954
MTLD Index 71
HDD Index 67
Yule's I Index 75
Lexical Diversity Index (MTLD + HD-D + Yule's I) 71

The type-token ratio (TTR) of Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles is 0.215665. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 8,307, while the number of words is 38,518, so the TTR is 8,307 / 38,518 = 0.215665. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 8,307 / (38,518 * 38,518) = 0.00000559908), while in CTTR, it is divided by a square of the number of words, multiplied twice 8,307 / 2 * (38,518 * 38,518) = 0.00000279954). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 6, making it understandable for 6-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 71 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 58.

Other Information about Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles by Helena Westermarck

We provide you a sample of the text below, however, the full text of the Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles is also available free of charge on our website.

Sample of text:

Fröknarna Wiik kunde tryggt stanna kvar ombord; färden gick till Hamburg i alla fall och på den stora båten funnos även några hyttplatser för passagerare. Det blev en mycket angenäm och rolig färd, som båda systrarna senare ofta med förtjusning beskrivit för sina vänner. Hytterna voro rymliga och bekväma, maten var utmärkt och de båda damerna, som voro de enda kvinnliga passagerarna ombord, bemöttes på det mest uppmärksamma och höviska sätt. En högst ovanlig förströelse erbjöds dem även därigenom att en del av den stora båtens last bestod av — en mängd levande djur från tropikerna, avsedda för den stora zoologiska trädgården i Hamburg. Här funnos många olika slag av fyrfota djur, bland vilka det remarkablaste torde varit »världens minsta elefant»; papegojor och andra exotiska fåglar lyste i grannaste färgprakt och en mängd apor bidrogo genom sin rörlighet och sina löjliga upptåg till att ...

Top most frequently used words in Tre Konstnärinnor - Fanny Churberg, Maria Wiik och Sigrid af Forselles by Helena Westermarck*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 1,455 3.78%
2 att 675 1.75%
3 av 662 1.72%
4 en 555 1.44%
5 som 543 1.41%
6 den 477 1.24%
7 hon 439 1.14%
8 det 428 1.11%
9 för 425 1.1%
10 de 404 1.05%
11 till 404 1.05%
12 med 386 1%
13 375 0.97%
14 jag 310 0.8%
15 sig 254 0.66%
16 var 241 0.63%
17 ett 234 0.61%
18 är 213 0.55%
19 där 182 0.47%
20 174 0.45%
21 om 165 0.43%
22 sin 161 0.42%
23 har 161 0.42%
24 han 157 0.41%
25 hade 156 0.41%
26 hennes 153 0.4%
27 under 141 0.37%
28 137 0.36%
29 icke 133 0.35%
30 mycket 127 0.33%
31 men 121 0.31%
32 mig 110 0.29%
33 dem 108 0.28%
34 här 105 0.27%
35 sina 103 0.27%
36 man 102 0.26%
37 även 101 0.26%
38 eller 101 0.26%
39 Paris 100 0.26%
40 vi 99 0.26%
41 denna 98 0.25%
42 vid 97 0.25%
43 ej 96 0.25%
44 efter 95 0.25%
45 från 93 0.24%
46 skulle 86 0.22%
47 tid 83 0.22%
48 sitt 80 0.21%
49 stora 79 0.21%
50 Fanny 78 0.2%
51 nu 78 0.2%
52 alla 76 0.2%
53 ut 75 0.19%
54 genom 73 0.19%
55 arbete 71 0.18%
56 henne 71 0.18%
57 kunde 71 0.18%
58 sedan 70 0.18%
59 Maria 69 0.18%
60 blev 68 0.18%
61 oss 68 0.18%
62 Wiik 66 0.17%
63 hos 65 0.17%
64 vara 65 0.17%
65 också 64 0.17%
66 andra 62 0.16%
67 år 61 0.16%
68 huru 60 0.16%
69 hem 60 0.16%
70 såsom 59 0.15%
71 några 57 0.15%
72 kan 57 0.15%
73 redan 56 0.15%
74 än 55 0.14%
75 över 54 0.14%
76 54 0.14%
77 någon 53 0.14%
78 dessa 52 0.14%
79 dock 52 0.14%
80 konst 51 0.13%
81 varit 50 0.13%
82 detta 50 0.13%
83 talet 50 0.13%
84 nya 49 0.13%
85 se 49 0.13%
86 upp 48 0.12%
87 Helsingfors 48 0.12%
88 stor 48 0.12%
89 första 47 0.12%
90 voro 47 0.12%
91 äro 46 0.12%
92 ha 46 0.12%
93 följande 46 0.12%
94 skall 46 0.12%
95 ännu 45 0.12%
96 brev 44 0.11%
97 gång 44 0.11%
98 mot 44 0.11%
99 Churberg 44 0.11%
100 utan 43 0.11%
101 du 43 0.11%
102 unga 43 0.11%
103 vad 43 0.11%
104 af 42 0.11%
105 många 42 0.11%
106 hela 42 0.11%
107 endast 42 0.11%
108 väl 41 0.11%
109 hans 40 0.1%
110 Forselles 40 0.1%
111 både 40 0.1%
112 själv 40 0.1%
113 allt 40 0.1%
114 något 39 0.1%
115 del 39 0.1%
116 ju 38 0.1%
117 dess 38 0.1%
118 ateljé 38 0.1%
119 Sigrid 38 0.1%
120 blivit 38 0.1%
121 deras 37 0.1%
122 dag 37 0.1%
123 ville 37 0.1%
124 åt 37 0.1%
125 flere 36 0.09%
126 tavlor 36 0.09%
127 konstnärer 36 0.09%
128 ur 36 0.09%
129 ty 36 0.09%
130 vilka 36 0.09%
131 göra 36 0.09%
132 fått 35 0.09%
133 vår 35 0.09%
134 mer 35 0.09%
135 hava 35 0.09%
136 vilken 34 0.09%
137 studier 34 0.09%
138 helt 34 0.09%
139 åter 33 0.09%
140 fick 33 0.09%
141 naturen 33 0.09%
142 33 0.09%
143 ofta 32 0.08%
144 lilla 32 0.08%
145 min 32 0.08%
146 alltid 31 0.08%
147 liv 31 0.08%
148 början 31 0.08%
149 måste 31 0.08%
150 fram 31 0.08%
151 kom 30 0.08%
152 Diisseldorf 30 0.08%
153 naturligtvis 30 0.08%
154 in 30 0.08%
155 målade 30 0.08%
156 samt 29 0.08%
157 hemma 29 0.08%
158 vistades 28 0.07%
159 bland 28 0.07%
160 senare 28 0.07%
161 komma 28 0.07%
162 får 28 0.07%
163 Hanna 28 0.07%
164 liten 27 0.07%
165 porträtt 27 0.07%
166 gjort 26 0.07%
167 mitt 26 0.07%
168 mest 26 0.07%
169 kommer 25 0.06%
170 två 25 0.06%
171 honom 25 0.06%
172 olika 25 0.06%
173 stod 25 0.06%
174 kl 25 0.06%
175 samma 25 0.06%
176 konstnärinnan 25 0.06%
177 sätt 25 0.06%
178 sålunda 24 0.06%
179 mina 24 0.06%
180 hösten 24 0.06%
181 inte 24 0.06%
182 vackra 24 0.06%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Tre Konstnärinnor  - Fanny Churberg, Maria Wiik och Sigrid af Forselles by Helena Westermarck

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.