Universal Approximation Theorem for Neural Networks

I have been reading background on this for a class I will be teaching at work. Having no background in functional analysis and very limited exposure to topology, the terminology employed and the logic followed by George Cybenko in his 1989 proof of this theorem left me scratching my head. I found, however, a very helpful gloss on the proof by Daniel McNeela.

