ELSA: Hardware-Software Co-design for Efficient, Lightweight Self-Attention Mechanism in Neural Networks | IEEE Conference Publication | IEEE Xplore