Ve, a linguistic framework

Preview:

Citation preview

Ve

@KIMTARO

JISHO.ORG

LINGUISTICFRAMEWORK

FOR YOU

HOLY SHITWHY AREN’T WE US ING

MORE LINGUISTICS?!?!?!=!??!=!?!111one

USES?

SEARCH

!ies

!ies!y

SENTENCES

The U.S.A. Ambassadorto S. Korea.

The U.S.A. Ambassadorto S. Korea.

WORDS

ビールを飲む

ビール を 飲む

TRANSLITERATIONS

酒さけ

TAGGING

“I like beer„

“I like beer„

beer

PROBLEMS:(

STEMMING

!ies

!ies

!i

!ies

!i

LANGAUGE

HAS

EDGE CASES

IRREGULAR

ITIES

ENGLISH ONLY

PARSERS

ALL. THE.THINGS.

FUCK YEAH

CRASH

BUGGY

MEMORY

I N P U TOUTPUT

CHARACTERENCODINGS

DESTRUCTIVE

CASINGWHITESPACE

PUNCTUATION

LINGUISTSONLY

DIFFERENTPART OF

SPEECH SETS

BROWN CORPUS: 230IPADIC: 120

飲ん�����動詞だ ����助動詞

MANAGESPARSERS

ONE APITO RULE

THEM ALL

MULTIPLELANGUAGES

UNIFIEDPART OF

SPEECH SET

18

LEMMA

!ies

!y

♥YOUR DATA

LOWLEVEL

ACCESS

PARSERS EN EN EN JA JA SW CH

VE SERVER

PROVIDERS

MANAGERS EN JA SW CH

INTERFACES RUBY local JSRUBY remote

SERVER

RUBY gem JS libVE CLIENTS

gem "ve", '0.0.2', :git => 'git://github.com/Kimtaro/ve.git'

words = Ve.in(:ja).words(input)

words.each do |word| word.word word.part_of_speech.name word.extra[:reading].katakana_to_hiraganaend

#<Ve::Word:0x8edfe24@word="melons",@lemma="melon",@part_of_speech=Ve::PartOfSpeech::Noun,@tokens=[{ :raw=>"melons melon NNS 1", :type=>:parsed, :literal=>"melons", :lemma=>"melon", :pos=>"NNS", :accuracy=>"1", :characters=>7..12}],@extra={:grammar=>:plural},@info={}>

#<Ve::Word:0x8edfe24@word="melons",@lemma="melon",@part_of_speech=Ve::PartOfSpeech::Noun,@tokens=[{ :raw=>"melons melon NNS 1", :type=>:parsed, :literal=>"melons", :lemma=>"melon", :pos=>"NNS", :accuracy=>"1", :characters=>7..12}],@extra={:grammar=>:plural},@info={}>

#<Ve::Word:0x8edfe24@word="melons",@lemma="melon",@part_of_speech=Ve::PartOfSpeech::Noun,@tokens=[{ :raw=>"melons melon NNS 1", :type=>:parsed, :literal=>"melons", :lemma=>"melon", :pos=>"NNS", :accuracy=>"1", :characters=>7..12}],@extra={:grammar=>:plural},@info={}>

#<Ve::Word:0x8edfe24@word="melons",@lemma="melon",@part_of_speech=Ve::PartOfSpeech::Noun,@tokens=[{ :raw=>"melons melon NNS 1", :type=>:parsed, :literal=>"melons", :lemma=>"melon", :pos=>"NNS", :accuracy=>"1", :characters=>7..12}],@extra={:grammar=>:plural},@info={}>

Ve.in(:ja).sentences(input)

Ve.in(:ja).transliterate_from_hira_to_latn(input)

<script src="ve.js"></script><script> Ve('en').words('I like beer', function(words) { $.each(words, function(i, w) { w.lemma; w.part_of_speech; }); });</script>

github.com/Kimtaro/ve

Recommended