Reposting some of key lines from article
Snippet #1
A supervised approach means having a training dataset that is used to extract features from and create a model. Item-wise regression means that the model will give us a numeric score for each answer that we can use to rank them.
Snippet #2
Features Rating
At Quora we define good answers to have the following five properties:
Analysis #1
- Answers the question that was asked.
- Provides knowledge that is reusable by anyone interested in the question.
- Answers that are supported with rationale.
- Demonstrates credibility and is factually correct.
- Is clear and easy to read.
My understanding on their approach
- Keywords would be extracted from the question to identify features that it talks about. These keywords may be used to weight the answers or relate to answers
- Number of Views
- Number of Upvotes
- Context (Topic)
- Number of Comments
- Score based on words(features) used
- Assign a Overall score
- For answers related to same topic matching keywords, Match it to existing features and compare with it to provide comparative ranking.
- I suppose they may do OCR as well to extract text from images.
- Translating Quora answers into any other language might need substantial re-work and building corpus in the target language
Happy Learning!!!
No comments:
Post a Comment