따라서 인코딩보다는 디코딩에 더 맞음

Source: Deep Learning on Medium

…has 100% correlation with the target). In this way, the model would not be learning anything useful neither about the grammar or syntax of the language nor the meanings of the tokens and would not be useful for downstream tasks.