Synapse at CAp 2017 NER challenge: Fasttext CRF

09/14/2017
by   Damien Sileo, et al.
0

We present our system for the CAp 2017 NER challenge which is about named entity recognition on French tweets. Our system leverages unsupervised learning on a larger dataset of French tweets to learn features feeding a CRF model. It was ranked first without using any gazetteer or structured external data, with an F-measure of 58.89%. To the best of our knowledge, it is the first system to use fasttext embeddings (which include subword representations) and an embedding-based sentence representation for NER.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset