Skip to content

How to add image data augmentation #7273

@wufan-tb

Description

@wufan-tb

I am seeking assistance regarding an issue I'm encountering.

I intend to incorporate image data augmentation into the SFT process within ms-swift. Currently, this augmentation logic is integrated within the Template._encode function.

However, when training with a substantial dataset, while the mapping and packing stages proceed without incident, the training phase consistently results in a CUDA OOM error. I have confirmed that this error does not manifest when using the same volume of data without augmentation, or when training with a smaller dataset.

Could you please elucidate the potential reasons for this behavior, and what modifications or strategies would you recommend to resolve this issue?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions