It might be this:
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278-2324.
This paper introduced convolutional (weight-sharing) networks - now popularly known as Deep Neural Networks - and showed they could be used in real-world problems. Cited 24,100 times, according to Google Scholar (2020-01-29) - over 1,000 citations per year on average.
Oh and - psychologists take note - published in conference proceedings.