Semitic Morphological Analysis and Generation Using Finite State Transducers with Feature Structures

Michael Gasser

wispy clouds in a blue sky

EACL 2009
Athens
1 April, 2009

Goals

Tigrinya verb morphology 1

Tigrinya verb morphology 2

Tigrinya verb morphology: a maximal example

Ti verb

Representing the structure of a verb

'abzəytɨr_axəbalun ↔ `[ (text(root)=text('rkb')), (text(sbj)=[-text(p1),-text(p2),+text(plr),+text(fem),-text(prep)]), (text(obj)=[-text(p1),-text(p2),-text(plr),-text(fem),+text(prep)]), (+text(neg)), (+text(rel)), (text(prep)=text('ab'-)), (text(suf_conj)=text(-'n')) ]`

Finite state morphology

The components of a complete system

The composed FST

composition

The problem of non-concatenative morphology

Non-concatenative example 1

Imperfective stem, three voice categories, CC_C root

CC_C templates 0 CC_C templates 1 CC_C templates 2

Non-concenative example 2

Negation, relativization dependencies

rel neg 0 rel neg 1 rel neg 2

FSTs "weighted" with feature structures (Mohri, Amtrup)

Weighted FSTs for Tigrinya verbs: affix dependencies: analysis 1

neg 0 neg 1 neg 2 neg 3 neg 4

Weighted FSTs for Tigrinya verbs: affix dependencies: analysis 2

neg rel 0 neg rel 1 neg rel 2 neg rel 3 neg rel 4

Weighted FSTs for Tigrinya verbs: stem: analysis

cc_c 0 cc_c 1 cc_c 2 cc_c 3 cc_c 4 cc_c 5 cc_c 6

Weighted FSTs for Tigrinya verbs: stem: generation

cc_c gen0 cc_c gen1 cc_c gen2 cc_c gen3 cc_c gen4 cc_c gen5 cc_c gen6 cc_c gen7

Tigrinya verb morphology FSTs: architecture

arch guess

Evaluation