4The problem of language modeling is essentially density estimation for text data.