Beruflich Dokumente
Kultur Dokumente
Lexical knowledge
Early works
E-HowNet
Lexical knowledge representation
hyperonym
{vehicle}
{hinge; flexible joint}
hyperonym {bumper}
{motor vehicle; automotive vehicle} meronym
{car door} {doorlock}
meronym meronym
hyperonym
{car window}
{car; auto; automobile; machine; motorcar} {armrest}
meronym
{car mirror}
hyperonym hyperonym
{cruiser; squad car; patrol car; police car; prowl car} {cab; taxi; hack; taxicab; }
Table 1: WordNet1.5 Relations
Relation PoS linked Example EWN
ANTONYMY noun/noun; verb/verb; man/woman; enter/exit; yes
adjective/adjective beautiful/ugly
HYPONYMY noun/noun slicer/knife yes
MERONYMY noun/noun head/nose yes
ENTAILMENT verb/verb buy/pay SUBEVENT or
CAUSE
TROPONYM verb/verb walk/move HYPONYMY
CAUSE verb/verb kill/die yes
ALSO SEE verb/adjective no
DERIVED FROM adjective/adverb beautiful/beautifully yes
ANTONYM noun/noun; verb/verb heavy/light yes
ATTRIBUTE noun/adjective size/small XPOS_HYPONYM
RELATIONAL adjective/noun atomic/ atomic bomb PERTAINS TO
ADJ
SIMILAR TO adjective/adjective ponderous/heavy no
PARTICIPLE adjective/verb elapsed/ elapse no
Disadvantages
No information regarding conceptual
differences between different synsets
Do not provide mechanisms for
semantic composition
No information for unknown words
HowNet :
HowNet describes the following conceptual
relations﹕
(1) Hypernymy 上下位關係
(2) Synonymy 同義關係
(3) Antonymy 反義關係
(4) Attribute-host 屬性-宿主關係
(5) Part-whole 部件-整體關係
(6) Event-role事件-角色關係
HowNet uses about 2000 primitives to define
concepts.
Examples :
僱主={human|人:{employ|僱用:agent={~}}
僱員={human|人:{employ|僱用:patient={~}}
熨斗={tool|用具:{AlterForm|變形狀:
result={level|平}}
假期={time|時間:{rest|休息},{WhileAway|
消閑}}
旅館={Institute Place|場所:{reside|住下,
tour|旅遊:location={~}}
救生艇={ship|船:{rescue|救
助:instrument={~}}
HowNet ontology :
V event|事件
V1 static|靜態 V2 act|行動
V1.0 relation|關係 V2.0 AlterRelation|變關係
V1.01 isa|是非關係 V2.01 AlterIsa| 變是非
…
V1.1 state|狀態 V2.1 AlterState|變狀態
Common sense knowledge
Disadvantages
Representation by primitives degrades precision and
readability.
老虎tiger DEF={beast|走獸}
鉗子forceps DEF={tool|用具:{hold|拿:instrument={~}}}
鐘錶店watchmaker's shop
DEF={InstitutePlace|場所:{buy|買:
location={~},possession={tool|用具:{tell|告訴:
content={time|時間},instrument={~}}}},{repair|修理:
location={~},patient={tool|用具:{tell|告訴:content={time|
時間},instrument={~}}}},{sell|賣:
location={~},possession={tool|用具:{tell|告訴:
content={time|時間},instrument={~}}}}}
Without considering semantic composition and
decomposition.
E.g. function words: 僅just
DEF={FuncWord|功能詞:emphasis={?}}
E-HowNet
Outline
What is E-HowNet?
Lexical sense representation
Compositional semantics
Applications of E-HowNet
Future research
What is E-HowNet?
Lexical sense representation
Compositional semantics
E-HowNet
E-HowNet is an entity-relation model
extended from HowNet for lexical semantic
representation.
A uniform semantic representation for function
words, content words and phrases.
Semantic relations are explicitly expressed in E-
HowNet representations.
Semantic composition and decomposition
capabilities.
Near-canonical sense representation.
E-HowNet- Sense
Representation
Word sense definition- decompose a sense into
simpler senses and sense relations are explicitly
expressed
果盤 fruit plate
def:{plate|盤:telic={put|放置: location={~},patient={fruit|
水果}}}
玻璃盤 glass plate
def: {plate|盤:material={glass|玻璃}}
圓盤 round plate
def: {plate|盤:shape={round|圓}}
E-HowNet- Sense
Representation
Uniform representation for function
words, content words and phrases
Preposition: 從
• def: location-source={},
• def: time-init={}
Conjunction: 因為
• def: reason={}
Adverb: 透頂
• def: degree={very|很}
Definitions of function words
Function words Content words
Relational senses ---------------------------------------- Content senses
De的, prepositions, …adjectives, verbs, nouns
Conjunctions, adverbs…
written.
{write|寫:goal={text|語文}, aspect={Vachieve|達
成}}
E-HowNet- Sense
Representation
High-level representations can be decomposed
into primitive representations.
E.g. ‘獅子狗Beijing dog’
def1: {狗|dog: source={北京|Beijing}}.
Extention result:
def2: {livestock|牲畜: telic={看家|MindTheHouse: agent={~}},
source={北京|Beijing}}.
def3: {livestock|牲畜: telic={TakeCare|照料:patient={family|家
庭}}:agent={~}}, source ={capital|國都: name={"北京"},
location={China|中國},
quantifier={definite|定指}}}.
E-HowNet- Sense
Representation
High-level representations can be decomposed
into primitive representations.
The primitives are adopted from HowNet, called
sememes 義原.
The set of primitives has about two thousand elements
and organized into taxonomy of entities and relations.
Taxonomy of E-HowNet
http://ckip.iis.sinica.edu.tw/taxonomy/
All|全
entity|事物
event|事件
state|狀態
act|行動
AttributeValue|屬性值
object|物體
thing|萬物
time|時間
space|空間
relation|關係
Semantic Role|語意角色
function|函數
Principles for word sense
definitions
Principles for sense definitions
Use hypernym and prominent
properties to define concepts.
Qualia structure- agentive, telic, formal,
and constitutive
Use well-defined/primitive concepts and
relations to define new concepts.
Principles for sense definitions
Agentive-the factors involved in the origin or
“bringing about” of the object
color weight
陳|old 酒|wine 快|fast 車|car
Time Speed
Agentive={produce|製造} Telic={move|移動}
Application of semantic types
Disambiguation of transitive Verb+Noun structure
verb objects modifier head
檢驗|inspect + Noun
Telic={produce|製造} Predication={擊發}
Note: objects are rarely to be suffix or
prefix of verbs.
Val1+Val2
Conjunctive structure
Synonym: Hypernym(Val1, Val2) e.g. 豐富
Antonym : attribute e.g. 好壞、興衰
Modifier-head structure
Val1=degree value = {大、深、寡、豔、...}
e.g. 大好、深紅
Verb result structure
Most results are states (values).
Val2={起來、下去…}
Act+Value
Act+ValueVR
{Act: result={Value}}
Value={P-state, M-state, E-state}
Problem: the transitivity of the result VR
compound?
Intransitive: 跑累、氣昏
Transitive: 哭瞎、打破
Value+Act
Modifier+headAct
{Act: manner={value}} or {Act: E-
attri={E-value}}
E.g. 靜思 def ={思想: manner={靜}}
快煮 def ={煮: speed={快}}
Note: there is a set of actions which are
always nominalized at suffix position.
(See next page)
Object or Value+Act (nominal
action)
Obj or Val+Action affairs
Action={存、收、考、行、吻、射、改、治
、防、…}
{affairs|事務: CoEvent={Act}}
Animal+{叫}
長、安、全壘+{打}
Obj+Value
Obj+Shape-Value(形狀量詞)
{Obj: shape={value}}
E.g. 串、粉、捲、圈、桿、棒、管、環、末
、條、塊、團、屑
Obj+color-value color-Value
米白、酒紅
Obj+odor-value odor Noun
香、臭、腥
Value+Obj
cf. HowNet definitions are very rough; for examples all dogs are
defined as: {livestock|牲畜}.
Advantages of E-HowNet
Achieves near canonical meaning
representation.
機長機敏地抓獲女搶犯 vs. 飛機駕駛員敏捷的逮捕女強盜
Syntactic parsing
Def:{抓獲:agent={機長},patient={搶:gender={女}},manner={機
敏}}
Def:{逮捕:agent={飛機正駕駛},patient={強
盜:gender={女}},manner={敏捷}}
E-HowNet Expansion →
Advantages of E-HowNet
Def: {catch|捉住:
agent={human|人:HostOf={Occupation|職位},modifier={official|官},
predication={manage|管理:agent={~},patient={aircraft|飛行器}}},
patient={human|人:modifier={guilty|有罪},predication={rob|搶:agent={~}},
gender={female|女}},
manner={clever|靈}}
Def: {catch|捉住:
agent={human|人:HostOf={Occupation|職位},modifier={official|官},
predication={manage|管理:agent={~},patient={aircraft|飛行器}}},
patient={human|人:modifier={guilty|有罪},predication={rob|搶:agent={~}},
gender={female|女}},
manner={nimble|捷}}
Advantages of E-HowNet
Multi-level representations
Tailor store|裁縫店
def: {store|店:telic={tailor|裁縫:place={~}}}
cf. HowNet def: {InstitutePlace|場所:{produce|製造:
PatientProduct={clothing|衣物},location={~}}}
Hownet concepts are defined by primitive concepts. For
instance, in the above example, the basic concept
InstitutePlace|場所 lost the information of “commerce” in
「店」.
Advantages of E-HowNet
E-HowNet is language independent.
E-HowNet uses WordNet synsets as description
language.
W_C=公佈欄
W_E=bulletin board
DEF={facilities|設施:{put|放
置:location={~},patient={text|語文:{announce|發
表:content={~}}}}}
DEF3={[(establishment)]:{[(put,set,place)]:location={~}
,patient={[(text,textual_matter)+(command_language,qu
ery_language,search_language)]:{[(announce,denote)]:c
ontent={~}}}}}
Advantages of E-HowNet
E-HowNet did not create a completely
new ontology, but accommodates other
ontologies.
Links are established between different
ontologies.
WordNet synsets will be an intermediate
language.
Outline
What is E-HowNet?
Lexical sense representation
Compositional semantics
Applications of E-HowNet
Future research
Difficulties and future research
Semantic representation
Domain specific concepts
Domain terms: 質數|prime number 、二氧化碳
|carbon dioxide …
Relative entities:他人|others、外野| out field …
Fine-grained features
Aspects and viewpoints
Difficulties and future research
Semantic composition
Word identification- word segmentation
and unknown word identification
Sentence parsing- syntactic structure
analysis and semantic role assignment
Word sense disambiguation
Meaning facet determination
Generic or instance
Difficulties and future research
Semantic composition
Anaphoric references
Fine-grained semantic relations and gaps
Construction meaning and metaphoric
inferences
View point normalization
Buy 買: Sell 賣
Borrow 借: Lend 借
Cause 因為: Result 所以