Two new datasets were created; a prefix-tuning baseline and ADIFF, which uses a cross-projection module and position captioning, were compared; ADIFF showed significant improvements via objective and human evaluation.
1 year ago
2
1
0
0