Languages use different systems for classifying nouns. Gender languages assign many — sometimes all — nouns to distinct sex-based categories, masculine and feminine. We construct a new data set, documenting this property for more than four thousand languages which together account for more than 99 percent of the world’s population.