Jurafsky and Martin (to appear) Speech and Language Processing (3rd ed. draft)
Goodfellow, Bengio and Courville (2016) Deep Learning
Noah Smith (2011) Linguistic structure prediction
Manning and Schütze (1999), Foundations of Statistical Natural Language Processing