Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cropping issue discussion #40

Open
nitinmukesh opened this issue Aug 18, 2024 · 6 comments
Open

Cropping issue discussion #40

nitinmukesh opened this issue Aug 18, 2024 · 6 comments

Comments

@nitinmukesh
Copy link
Collaborator

nitinmukesh commented Aug 18, 2024

Arvrairobo commented 3 days ago

@nitinmukesh thank you for your PR, but it is not cropping the image, i have applied your PR and it is running successfully but it is not cropping the image and due to that, if there is a full image (not just face) then anitalker is not able to generate talking face. could you please recheck cropping code? so it should automatically crop the image and use that cropped image as an input correct? but it is not working like that

for following issue
ERROR: No matching distribution found for pytorch-lightning==1.6.5

python -m pip install pip==24.0

Make sure to install following
https://github.com/X-LANCE/AniTalker/blob/main/md_docs/run_on_windows.md

@nitinmukesh
Copy link
Collaborator Author

Test result with auto-crop

Images used

1

2

3

5

Output

1.mp4
2.mp4
3.mp4
5.mp4

@Arvrairobo
Copy link

@nitinmukesh thank you for the help, yes i figured out why it was not working so i missed two files

one was that .dat file and that .npy file, once i placed that in data_preprocess folder, it started to work thank you very much for your help

@Arvrairobo
Copy link

Arvrairobo commented Aug 19, 2024

@nitinmukesh @newgenai79
can we achieve cropping with 512 X 512 resolution instead of 256X 256 ?

also i can see the blink and eyes movement are not very natural compares to echomimic, how we can achieve the same? i tried huber_pose as well but not getting perfect result.

what is hubert_full_control?

@nitinmukesh
Copy link
Collaborator Author

nitinmukesh commented Aug 19, 2024

@nitinmukesh @newgenai79 can we achieve cropping with 512 X 512 resolution instead of 256X 256 ?

I don't think it's possible since the model is trained on 256 x 256 resolution.

also i can see the blink and eyes movement are not very natural compares to echomimic, how we can achieve the same?

EchoMimic is good as it is trained on large dataset, hence the results are better.

@Arvrairobo
Copy link

@nitinmukesh echomimic is very slow, for 40 secs video it took about 2.5 hours so not a practical solution. anitalker is very fast, thats why i am building a solution on top of it.

if author @liutaocode or any author can shed some lights on it or give some headstart on how to achieve blink and facial expression, that would be great, or if they release a code or more trained model that would also work, looking forward to it

@Arvrairobo
Copy link

still waiting for blink feature, also any update on the project? any new release of the code? or any new features?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants