Skip to content

Comments

Experimenting with alternative siglip loss impl for better dist scaling#971

Merged
rwightman merged 2 commits intomainfrom
revisit_siglip_loss
Dec 4, 2024
Merged

Experimenting with alternative siglip loss impl for better dist scaling#971
rwightman merged 2 commits intomainfrom
revisit_siglip_loss

Conversation

@rwightman
Copy link
Collaborator

@JeniaJitsev
Copy link
Contributor

Very nice, giving it a try

@rwightman
Copy link
Collaborator Author

So gather impl here looks like it's a better balance but still scaling issues wrt to throughput. The additions here allow flexiblility to add other impl that can further improve things so will merge what's here for now.

@rwightman rwightman merged commit aeaf2a0 into main Dec 4, 2024
@rwightman rwightman deleted the revisit_siglip_loss branch December 4, 2024 23:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants