Refactor chunked preference functions and distillation base class #491

shivam15s · 2024-12-20T00:51:03Z

Summary

Remove redundant code by refactoring

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

This reverts commit fac9b78.

hebiao064 · 2024-12-20T05:43:29Z

src/liger_kernel/chunked_loss/cpo_loss.py

-            beta (float): Weight for the CPO loss
+            chosen_logps_chunk (torch.Tensor): Avg log probabilities of chosen tokens in the chunk. Shape: (batch_size,).
+            rejected_logps_chunk (torch.Tensor): Avg log probabilities of rejected tokens in the chunk. Shape: (batch_size,).
+            full_target (torch.Tensor): Non chunked full target tensor.


I wonder is it full_target or actually target_chunk?

From the fused function, we are feeding into target_chunk

def fused_fwd_bwd( input_chunk, target_chunk, ref_input_chunk, preference_labels_chunk ): """ Fused forward and backward pass for a chunk of input and target. """ if bias is not None: return torch.func.grad_and_value( compute_loss, argnums=(0, 1, 3), has_aux=True )( input_chunk, weight, target_chunk, bias, ref_input_chunk=ref_input_chunk, preference_labels=preference_labels_chunk, )

Feel like it should be the full target as we use it to normalize and then sum up for all chunks but seems we're feeding the target_chunk instead?

shivam15s added 5 commits December 20, 2024 00:49

refactor

653c181

checkstyle

4950162

num items in batch

fac9b78

Revert "num items in batch"

7a28d1d

This reverts commit fac9b78.

refactor to include chunk for clarity

2487488

shivam15s marked this pull request as draft December 20, 2024 02:49

shivam15s added 5 commits December 20, 2024 03:01

refactor distillation base

5e5c11e

refactor fn order for distillation base

e4d8233

Merge branch 'main' into shisahni/preference_refactor

aa44a9f

checkstyle

e55e9bc

checkstyle

764e465

shivam15s marked this pull request as ready for review December 20, 2024 03:11

shivam15s changed the title ~~Refactor accumulate logic~~ Refactor chunked preference functions and distillation base class Dec 20, 2024

refactor more code

f40c6c8

hebiao064 reviewed Dec 20, 2024

View reviewed changes

hebiao064 mentioned this pull request Dec 21, 2024

Add KTO Loss #475

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor chunked preference functions and distillation base class #491

Refactor chunked preference functions and distillation base class #491

shivam15s commented Dec 20, 2024 •

edited

Loading

hebiao064 Dec 20, 2024

qingquansong Mar 3, 2025

Refactor chunked preference functions and distillation base class #491

Are you sure you want to change the base?

Refactor chunked preference functions and distillation base class #491

Conversation

shivam15s commented Dec 20, 2024 • edited Loading

Summary

Testing Done

hebiao064 Dec 20, 2024

Choose a reason for hiding this comment

qingquansong Mar 3, 2025

Choose a reason for hiding this comment

shivam15s commented Dec 20, 2024 •

edited

Loading