Well since it is only a text input AI it could only possibly attempt to do the VIQ part of a Weschler style IQ test, since the PIQ part requires understanding image abstractions (arrangements, block design, matrices of sequences etc).
I know there were some deep learning papers on how to train a model to pass the PIQ portion without human-coded heuristics (because, you could easily write a program to solve such questions if you knew ahead of time the format of the questions). I don't remember the outcomes however.