vector-based language identification
%s