From: DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning
Teacher
Student
Number of hidden layers
30
2
Number of attention heads
16
Hidden dimension
1024
Intermediate-size
4096
Number of parameters
420 M
26 M