Natural Language Processing: How to capture semantic similarity in texts