To address the performance degradation of cross-domain object detection under various illumination conditions and adverse weather scenarios, this paper introduces a novel method a called Step-wise Domain Adaptation DEtection TRansformer (SDA-DETR).Our approach decomposes the adaptation process into Hunting Trousers three sequential steps, progressively transferring knowledge from a labeled dataset to an unlabeled one using the DETR (DEtection TRansformer) architecture.Each step precisely reduces domain discrepancy, thereby facilitating effective transfer learning.In the initial step, a target-like domain HERBATINT FF2 is constructed as an auxiliary to the source domain to reduce the domain gap at the image level.
Then, we adaptively align the source domain and target domain features at both global and local levels.To further mitigate model bias towards the source domain, we develop a token-masked autoencoder (t-MAE) to enhance target domain features at the semantic level.Comprehensive experiments demonstrate that the SDA-DETR outperforms several popular cross-domain object detection methods on three challenging public driving datasets.