That wouldn't be too strange because in many languages including English (though I typically use singular they) the male pronoun is also the ungendered pronoun.
Edit: looked it up to confirm my suspicion; English alai traditionally uses he for the ambiguous pronoun.
I thought that the Google Translate corpus consisted of largely documents that were translated between multiple languages. In that case there was probably context telling the translator the gender, or they were translated from English with a gender to gender-neutral.
In this case I would expect that barring any other influence these are just whatever gender happened to be the most common for each verb in the input texts.
How would I feel if these were all the male pronoun he. I think I would feel odd, but not as odd. Unsure what that says
How did we get here. Is this yet another case of ML picking up on bias in seed data?