I think You used A+I not just I.

Source: Deep Learning on Medium

Stack the GCN layers. We here use just the identity matrix as feature representation, that is, each node is represented as a one-hot encoded categorical variable.