Autoregressive And Masked Language Modeling