Sie sind auf Seite 1von 5

(/dashboard)

23 63:22

Mark Up Word Groups And Link Wikipedia Articles -German


Wiki cation Annotation guidelines


Your task is to:

Find all word groups in English sentences that have a Wikipedia page and express a name, noun or topic.
Link each word group to the corresponding English Wikipedia pages
Align each word group to the corresponding translations in German sentences that express a name, noun or topic.
Link the aligned German word groups to the corresponding German Wikipedia pages

General Instructions for Task Success!


1. Identify, but DO NOT mark up, the rst word or word group in the English sentence that might have a Wikipedia page
and express a name, noun or topic.
Most general names and topics nowadays have their own wikipage.
We provide the previous context sentences and the course title to better understand the exact meaning of the
English sentence that you need to mark up.
Skip numbers and dates (even though for example 1990s does have its own wiki page).
Try to identify the longest possible matching word group that has a matching Wikipedia page title or a wiki-
redirection page. For example, in the sentenceHe became superintendent of the New York State Police in January
.the longest word group that has a wikipedia page is New York State Police.
2. Click on the link English Wikipedia Homepage and nd the Wikipedia page or redirection page with the same title and
the same meaning as this word or word group. (Wikipedia uses redirection links to link synonyms to the same page.)
When no match is found, try a smaller word group if possible.
Sometimes the wikipage title and the English text term do not exactly match but they do have the same meaning; in
such case we use that part of the text that does overlap with the wikipage title (for example the word Einstein links
to the wikipage Albert Einstein (https://en.wikipedia.org/wiki/Albert_Einstein))
When again no match is found, identify the next possible candidate and start again.
3. Mark up the word group that has a Wikipedia page and Click the [Add] button in the pop-up.
ry to avoid marking up verbs in English as relevant word groups (e.g. do not mark up "is married" because it doesnt
express a name, noun or topic.)
We do notmark up punctuation marks, adjectives or determiners (the, a/an, my/your/his/her/its/our/their etc)
unlessthey are part of the Wikipedia page title.
In a larger text fragment, we mark up all repeated word groups that reference to the same wiki page, but we do not
mark up pronouns like 'he', him, they
In case the English text term is in plural form, mark up the plural form, although it links to a wikipage title in singular
form (for example if you nd the word MOOCs, mark up MOOCs, although it links to the wikipage title MOOC)
As in point [1] we mark up the longest possible matching word group that has a matching Wikipedia page title or a
wiki-redirection page. For example, in the sentence He became superintendent of the New York State Police in
January .we mark up New York State Police as one unit. We do not mark up the smaller overlapping units New York
and State Police as the top level is already marked with an appropriate wiki link.
4. Copy the URL of the Wikipedia page and Paste it into the textbox that appears.
5. Click [Add URL] when you are nished (you cannot edit or change your selection after this step!).
6. Identify the corresponding word (group) in the German sentence (if indeed present).
7. Click on the link German Wikipedia Homepage and search whether the word group has an existing Wikipedia page or
redirection page with the same title and the same meaning.
Sometimes the Wikipedia page title and the text term do not exactly match; in such cases we use that part of the text
that does overlap with the Wikipedia page title. Apartial overlap like the Einsteinin the sentence that needs to be
linked to theWikipedia page Albert Einstein (https://de.wikipedia.org/wiki/Albert_Einstein)
Redirection: when searching for term A, Wikipedia uses re-direction to match against a synonym. Such redirections
are acceptable since they are imposed by Wikipedia.
Even if no match is found, continue to the next step.
8. Mark up the corresponding word (group) in the German sentence (we do not mark up punctuation marks, adjectives or
determiners e.g. der/die/das/ein/kein/mein/alle etc. unless they are part of the Wikipediapage title) and Click the [Add]
button in the pop-up.
If there is NOT a corresponding Wikipedia Page, type in NONE in the [Add URL] text box.
Else: paste the URL of the German Wikipedia page in the designated text box.
9. Click [Add URL] after checking that everything looks correct.
10. Continue with the next word group in the sentence and repeat steps 1-9 until all possible potential word groups were
inspected.

Speci c instructions with an Example


In the following Example 1 we show how this works.

(EXAMPLE 1) course title: History of music

English sentence: The beautiful song Biggest Mistake is written by The Rolling Stones .

German Sentence: Der wunderschne Song Biggest Mistake wurde von den Rolling Stones verfasst .

1. The rst word (group) that expresses a clear concept and might therefore have its own Wikipedia page is the word song.
2. We search for the word song in the English Wikipedia search bar and we nd a match with the Wikipedia page
https://en.wikipedia.org/wiki/Song (https://en.wikipedia.org/wiki/Song). We verify that the Wikipedia page refers to the
concept with the same meaning (musical composition) and this is indeed true.
3. You return to your task and mark up the word song and click [Add] button on the pop-up box.
4. Copy the Wikipedia URL that you have found andpaste it in the textbox.
5. Click on [Add URL].
6. Next, look for the corresponding word (Song) in the German sentence and mark it up and click [Add]. You can change the
markup before clicking on [Add] button.
The German language can be an automatic translation and can contain errors: we do not correct these errors!
7. Click on the link German Wikipedia Homepage and search whether the corresponding word has an existing Wikipedia
page or redirection page with the same title and the same meaning.
In this case, a Wikipedia page exists: https://de.wikipedia.org/wiki/Song (https://de.wikipedia.org/wiki/Song)
Note that the translator chose to use Song in German and not the near synonym Lied that has another wikipage.
We verify that the German Song wikipage does cover the same meaning as the way the word is used in this sentence
(and as the English wikipage albeit a more speci c meaning of popular music).
8. Copy the URL of the Song Wikipedia page and Paste it to the textbox.
9. Click [Add URL] when youre ready.
10. Continue with the next English word group Biggest Mistake where we nd that a corresponding Wikipedia page does
exist : https://en.wikipedia.org/wiki/Biggest_Mistake (https://en.wikipedia.org/wiki/Biggest_Mistake)
11. You return to your task, mark up the words Biggest Mistake and click [Add].
12. Next mark up the German equivalent word group Biggest Mistake, click[Add] and check whether it has a corresponding
German Wikipedia page. This is not the case and we type NONE in the URL text box and [Add URL].
13. The last potential word group in the English sentence is The Rolling Stones and we nd the matching Wikipedia page.
Note that here the word the is part of the name and also part of the Wikipedia page title and thus included in the mark
up and add: https://en.wikipedia.org/wiki/The_RollingStones (https://en.wikipedia.org/wiki/The_Rolling_Stones)
14. When we look for the German corresponding words, the word den is not part of the actual title of the Wikipedia page so
we only mark up Rolling Stones and link this to the page<a href="https://de.wikipedia.org/wiki/TheRollingStones"
target="blank"> https://de.wikipedia.org/wiki/The_Rolling_Stones
15. We show the nal annotated result in brackets below:
(EXAMPLE 1)

The beautiful [song] [Biggest Mistake] is written by [The Rolling Stones].

Der wunderschne [Song] [Biggest Mistake] wurde von den [Rolling Stones] verfasst.

1st set

[song] https://en.wikipedia.org/wiki/Song (https://en.wikipedia.org/wiki/Song)


[Song] https://de.wikipedia.org/wiki/Song (https://de.wikipedia.org/wiki/Song)

2nd set

[Biggest Mistake] https://en.wikipedia.org/wiki/Biggest_Mistake (https://en.wikipedia.org/wiki/Biggest_Mistake)

[Biggest Mistake]

3rd set

[The Rolling Stones] https://en.wikipedia.org/wiki/The_Rolling_Stones (https://en.wikipedia.org/wiki/The_Rolling_Stones)

[Rolling Stones] https://de.wikipedia.org/wiki/The_Rolling_Stones (https://de.wikipedia.org/wiki/The_Rolling_Stones)

More Examples
(EXAMPLE 2)

[Agrippa's Trilemma] states that there are three options if we try to prove the [truth] .

Another example to show how we annotate a sentence. Both [Agrippa] and [Trilemma] have their own separate Wikipedia
page, but we annotate the longest word group [Agrippa's Trilemma] that points, via redirection, to the Wikipedia page
entitled 'Mnchhausen trilemma'. Wikipedia uses redirection links to link synonyms to the same page.

[Agrippa's Trilemma] https://en.wikipedia.org/wiki/M%C3%BCnchhausen_trilemma


(https://en.wikipedia.org/wiki/M%C3%BCnchhausen_trilemma)

[truth] https://en.wikipedia.org/wiki/Truth (https://en.wikipedia.org/wiki/Truth)

(EXAMPLE 3)

The Fifth Element is a French science ction movie .

In (3) the determiner 'The' and adjective 'Fifth' are part of the matching Wikipedia page title and we mark them up.

The next longest possible word group that might have a page is French science ction movie. When searching for the full
phrase, we do not nd any hit.

Now we look for the smaller word group and here we have two options for splitting:

1. [French science ction] [movie]

2. [French] [science ction movie]

as we aim to capture the most prominent concept (a certain type of movie) we choose option 2 and search for [science ction
movie]. We nd the matching page via redirection. Note the we do not annotate adjectives (see point 3 of the general
instructions above) and we do not mark up the word French.

[The Fifth Element] https://en.wikipedia.org/wiki/The_Fifth_Element (https://en.wikipedia.org/wiki/The_Fifth_Element)

[science ction movie] https://en.wikipedia.org/wiki/Science_ ction_ lm (https://en.wikipedia.org/wiki/Science_ ction_ lm)

Mark Up Word Groups And Link Wikipedia Articles


Course : social innovation
English Context
English Context
In March 2011 there was an earthquake in Japan that triggered a tsunami leading to the Fukushima nuclear disaster.
Japan is one of the most dependent countries in the world on nuclear power.

German Context
Im Mrz 2011 gab es ein Erdbeben in Japan , das einen Tsunami auslste , der zur Nuklearkatastrophe von Fukushima
fhrte . Japan ist eins der von Kernernergie abhngigsten Lnder der Welt.

Mark up the English word/word group


However , the society in Japan adapted itself and learned to save energy .

Align the German word/word group (even if there is no Wikipedia page)


Doch die Gesellschaft in Japan hat sich angepasst und gelernt , Energie zu sparen .

Mark up and Add the word/word Paste Wikipedia Add Wikipedia


Search the word/word group in Wikipedia.
group. URL URL

English Wikipedia Homepage


(https://en.wikipedia.org/wiki/)

Add word/word group, when you're sure.

Add

Mark up the corresponding word/word group, even if there is no Wikipedia page and then click "Add"

Add
No Match

Mark Up Word Groups And Link Wikipedia Articles


Course : History of music
English Context
-

German Context
-

Mark up the English word/word group


The beautiful song Biggest Mistake is written by The Rolling Stones .

Align the German word/word group (even if there is no Wikipedia page)


Der wunderschne Song Biggest Mistake wurde von den Rolling Stones verfasst .

Mark up and Add the word/word Paste Wikipedia Add Wikipedia


Search the word/word group in Wikipedia.
group. URL URL

English Wikipedia Homepage


(https://en.wikipedia.org/wiki/)

Add word/word group, when you're sure.

Add
Mark up the corresponding word/word group, even if there is no Wikipedia page and then click "Add"

Add
No Match

Mark Up Word Groups And Link Wikipedia Articles


Course : Modeling and Simulation
English Context
We have a dataset with some outliers.

German Context
Wir haben ein Dataset mit ein paar Ausreiern.

Mark up the English word/word group


Calculate the mean and the median .

Align the German word/word group (even if there is no Wikipedia page)


Berechnen Sie den Mittelwert und Median .

Mark up and Add the word/word Paste Wikipedia Add Wikipedia


Search the word/word group in Wikipedia.
group. URL URL

English Wikipedia Homepage


(https://en.wikipedia.org/wiki/)

Add word/word group, when you're sure.

Add

Mark up the corresponding word/word group, even if there is no Wikipedia page and then click "Add"

Add
No Match