Final Presentation Tong Wang

Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Download PPTX Report

Upload
madeline-norton
View
215
Download
0

Tags:

Embed Size (px)

Citation preview

Final Presentation

Tong Wang

1. Automatic Article Screening in Systematic Review

2. Compression Algorithm on Document Classification

Automatic Article Screening

• Review Question: Vitamin C for preventing and treating common cold?

• Data set: 17 References articles.664 Not references articles.

Page 4: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Problem Definition

• Input : document d classes(c1 = Reference, c2 = not a reference)• Output: predicted class of d• Goal: find all articles belong to c1(Reference)

Page 5: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Build Features

• “Bag of Words” assumption: the order of words in a document can be neglected

• Preprocessing: tokenization, lemma, remove stop words, remove some part of speech.

• Need a step: Name Entity Recognizer(NER), it labels sequences of words which are the name of things. It is implemented by linear chain Conditional Random Field(CRF)

Page 6: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Build features

• Vector space model• Extract vocabulary over all articles.• Each document can be represented by a vector,

value in each dimension is the word frequency in this article

• N = size of vocabulary w1, w2, w3, w4… wNd1 1 0 2 0 … 0d2 0 1 0 0 … 0

Page 7: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Naïve Bayes

Page 8: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Logistic Regression

Page 9: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Discuss

• Define loss matrix, give high penalty for false negative.

• Another way is to use Cosine distance to compute similarity between articles. Wiki def:

• Use other nlp probability model, like LSA, LDA

Compression

• The basic idea is the data contains patterns that occur with a certain regularity will be compressed more efficiently

• It is generally inexpensive

Page 11: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

• d(x, y) = c(x y)/(c(x) + c(y))• x: A document • c(x) : size of compressed file x• xy: the file obtained by concatenating x and y• d(x,y) – 1/2 >= 0

C(x)

C(y)

C(xy)

Page 12: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Compression Matrix

a1 a2 a3 a4….b1 d(b1, a1) d(b1, a2)b2 d(b2, a1) d(b2, a2)b3b4…

Page 13: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Experiments

• Two groups of drug review(ADHD) articles.• Two groups of machine learning articles.• Each group has 15 articles• Intuitively d(ADHD, ADHD) < d(ADHD, machine learning)d(machine learning, machine learning) < d(ADHD, machine learning)

Page 14: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Page 15: Final Presentation Tong Wang. 1.Automatic Article Screening in Systematic Review 2.Compression Algorithm on Document Classification

Future work

• More experiments• Compare cosine(x, y) and d(x, y)

Compression Data files compression Music compression Image and video compression

Documents

TUINA – REMEDIAL MASSAGE - Natural Therapy · PDF fileTUINA – REMEDIAL MASSAGE Bu Tong Ze Tong, Tong Ze Bu Tong ... So Tuina involves deliberate manual techniques to generate internal

Documents

Drilling-Handling Tools Products CATALOG Drilling Handling Tools Products... · i. manual tong 1.1 manual tong type 1.2 manual tong type sb 1.3 manual tong type db 1.4 manual tong

Documents

Tong quan ve tong dai

Documents

tong quan.pdf

Documents

SHANGHAI JIAO TONG UNIVERSITY TONG SCHOOL OF ENVIRONMENTAL …

Documents

mansur yunus tong sabolari ziyouz comn.ziyouz.com/books/uzbek_zamonaviy_sheriyati/Mansur Yunus. Tong... · Mansur Yunus. Tong sabolari kutubxonasi 1 MANSUR YUNUS TONG SABOLARI Toshkent

Documents

Multimedia Compression ( Lossy Compression)

Documents

FROM TONG-TONG TOTEMPO DOELOE

Documents

GAN Compression: Efficient Architectures for Interactive … · 2020. 6. 8. · 1Massachusetts Institute of Technology 2Adobe Research.3Shanghai Jiao Tong University Muyang Li1,3,

Documents

Hyperspectral Imagery Compression Using Three Dimensional Discrete Transforms Tong Qiao (t.qiao@strath.ac.uk)t.qiao@strath.ac.uk Supervisor: Dr. Jinchang

Hyperspectral Imagery Compression Using Three Dimensional Discrete Transforms Tong Qiao ([email protected])[email protected] Supervisor: Dr. Jinchang

Documents

Tong Musik

Government & Nonprofit

POWER TONG SYSTEMS - matherneis.commatherneis.com/wp17/wp-content/uploads/2017/12/Power-Tong-Syst… · POWER TONG SYSTEMS Matherne Instrumentation’s power tong system is designed

Documents

CONTEXT-BASED ENTROPY CODING WITH SPACE ...CONTEXT-BASED ENTROPY CODING WITH SPACE-FREQUENCY SEGMENTATION IN ULTRASOUND IMAGE COMPRESSION by Chen Ji B.A.Sc., Shanghai Jiao Tong University,

Documents

Literature Review · Graduated compression stockings as prophylaxis for flight-related venous thrombosis: Systematic literature review Patients’ perceptions of participation in

Documents

Mo Hinh Tong Cung Tong Cau

Documents

Tong Dies / Slip Inserts - Rig Tool Products :: Tong Dies

Documents

Tong quanthietkeweb

Education

Image Compression Compression Fundamentals

Documents

RYU TONG -SHIK (YU TONG -SHIK) OR MATERNAL? …koreanchristianity.cdh.ucla.edu/images/stories/kirsteen_kim_Holy_spirit_of_Ryu.pdfRYU TONG -SHIK (YU TONG -SHIK) Kirsteen Kim Abstract:

Documents

$Fractography of compression failed carbon fiber reinforced ...€¦ · multidirectional laminates. However, it’s felt that if the fractography was to be fully understood, a systematic$