Analyzing URL Chatter on Twitter
From Knoesis wiki
Contents
[hide]Project Description
Objectives
- Classify the content of the urls
- Understand user perception of the websites
Motivation
- Search Engine Perspective - How to choose a page which is interesting to the user, given the keywords
- Publisher Perspective - What do people think about the page(URL)
Status
Week 1
- Extracting the Urls from the tweets.
- Recognizing the short/tiny Urls and transforming it into the long Urls.
Week 2
- Creating a table for the Urls and the tweets to store the urls
Future work
1. Make sure the Url is not short by checking it recursively 2. Themes-Entity extraction from the tweets rather than using the present available themes