18
ONTOLOGY BASED WEB CRAWLER SUBMITTED BY: Sachin Murwariya (9910103457)

JIIT; Project 2013- 14; CSE/IT; project presentation

Embed Size (px)

Citation preview

Page 1: JIIT; Project 2013- 14; CSE/IT; project presentation

ONTOLOGY BASED WEB CRAWLER

SUBMITTED BY:

Sachin Murwariya (9910103457)

Page 2: JIIT; Project 2013- 14; CSE/IT; project presentation

WHAT IS RSS

RSS is a defined standard for syndicating headlines and other content.

RSS is created using XML or eXtensible Markup Language, which is a markup language

similar to HTML. All fields are defined. Tags are used to denote the field’s classification.

Like HTML, proper construction requires that tags are both opened and closed.

Example: <title> Title of Item in Feed </title>

RSS has been around for more than a decade, but only recently the standard has been

embraced by bloggers, webmasters and large news portals as a means of distributing

Information, in a standardized format.

Page 3: JIIT; Project 2013- 14; CSE/IT; project presentation

WHAT IS ONTOLOGY BASED WEB CRAWLER

We present News Personalization using the Semantic Recommender, a news recommender system which appliesSemantic Web technologies to describe and relate news contents and user preferences in order to produce enhanced recommendations

Page 4: JIIT; Project 2013- 14; CSE/IT; project presentation

APPLICATIONS

◦ User profile construction

◦ Semantics based recommendation:

◦ Delevering categorised news items

Page 5: JIIT; Project 2013- 14; CSE/IT; project presentation

BENEFITS:

Help in constant update

Ease of Operation:

User can collect information from multiple sources into a single data stream.

Page 6: JIIT; Project 2013- 14; CSE/IT; project presentation

PROBLEM STATEMENT

The extremely large volume of online news has created an urgent need for tools that let users effectively and efficiently browse topics, detect temporal trends, and search news of interest.

For this we are preparing a ONTOLOGY BASED WEB CRAWLER to extract valuable information from large online news collections

Page 7: JIIT; Project 2013- 14; CSE/IT; project presentation

TEST PLAN

The purpose of testing is quality assurance, verification and

validation, or reliability estimation.

Unit Testing

Component testing

Integration testing

Validation Testing

System Testing

Page 8: JIIT; Project 2013- 14; CSE/IT; project presentation

ARCHITECTURE :

Page 9: JIIT; Project 2013- 14; CSE/IT; project presentation
Page 10: JIIT; Project 2013- 14; CSE/IT; project presentation

METHODS IN USE:

1. Crawling Algorithm

2. Concept Based Algorithm

3. Recommendation Algorithm

Page 11: JIIT; Project 2013- 14; CSE/IT; project presentation

CRAWLING ALGORITHM:

Page 12: JIIT; Project 2013- 14; CSE/IT; project presentation

Concept Based Algorithm

Page 13: JIIT; Project 2013- 14; CSE/IT; project presentation

RECOMMENDATION ALGORITHM

Recommender systems typically produce a list ofrecommendations in one of two ways - throughcollaborative or content-based filtering.

Collaborative filtering approaches build a modelfrom a user's past behavior (items previouslypurchased or selected )

Then use that model to predict items that theuser may have an interest in Content-basedfiltering approaches utilize a series of discretecharacteristics of an item in order to recommendadditional items with similar properties.

Page 14: JIIT; Project 2013- 14; CSE/IT; project presentation

IMPLEMENTATION

Login Page

Page 15: JIIT; Project 2013- 14; CSE/IT; project presentation

Search using keyword:

Page 16: JIIT; Project 2013- 14; CSE/IT; project presentation

TEST PLAN

The purpose of testing is quality assurance, verification and

validation, or reliability estimation.

Unit Testing

Component testing

Integration testing

Validation Testing

System Testing

Page 17: JIIT; Project 2013- 14; CSE/IT; project presentation

REFERENCES

[I] Ching Hsu .Taiwan, National Formosa University,2011.

[2] I.Jntema, F.Frasincar, F.Goossen and F.Hogenboom, Erasmus University Rotterdam, 2010

[3] M.Shea and M.Levene, University of London, UK, 2011

[4] Z.Rui-juan , Z. Yang-sen, 9th International Conference,2012

[5] S.Saha, A. Sajjanhar, S. Gao, R.Dew and Y. Zhao,0 IEEE 10th International Conference,2010

[6] Sajjanhar, A. Ying Zhao, ChinaGrid Annual Conference (ChinaGrid), 2012

[7] S. Sarumathi , (PRIME)International Conference,2012

[8] F.Goossen, W.IJntema, F.Frasincar, F.Hogenboom, U.Kaymak, Erasmus University Rotterdam.2011

Page 18: JIIT; Project 2013- 14; CSE/IT; project presentation

THANK YOU