Gender Inference using Statistical Name Characteristics in Twitter

06/17/2016
by   Juergen Mueller, et al.
0

Much attention has been given to the task of gender inference of Twitter users. Although names are strong gender indicators, the names of Twitter users are rarely used as a feature; probably due to the high number of ill-formed names, which cannot be found in any name dictionary. Instead of relying solely on a name database, we propose a novel name classifier. Our approach extracts characteristics from the user names and uses those in order to assign the names to a gender. This enables us to classify international first names as well as ill-formed names.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset