Cropping issue discussion #40

nitinmukesh · 2024-08-18T17:06:52Z

@nitinmukesh thank you for your PR, but it is not cropping the image, i have applied your PR and it is running successfully but it is not cropping the image and due to that, if there is a full image (not just face) then anitalker is not able to generate talking face. could you please recheck cropping code? so it should automatically crop the image and use that cropped image as an input correct? but it is not working like that

for following issue
ERROR: No matching distribution found for pytorch-lightning==1.6.5

python -m pip install pip==24.0

Make sure to install following
https://github.com/X-LANCE/AniTalker/blob/main/md_docs/run_on_windows.md

The text was updated successfully, but these errors were encountered:

nitinmukesh · 2024-08-18T18:09:37Z

Test result with auto-crop

Images used

Output

1.mp4

2.mp4

3.mp4

5.mp4

Arvrairobo · 2024-08-19T14:06:57Z

@nitinmukesh thank you for the help, yes i figured out why it was not working so i missed two files

one was that .dat file and that .npy file, once i placed that in data_preprocess folder, it started to work thank you very much for your help

Arvrairobo · 2024-08-19T14:08:24Z

@nitinmukesh @newgenai79
can we achieve cropping with 512 X 512 resolution instead of 256X 256 ?

also i can see the blink and eyes movement are not very natural compares to echomimic, how we can achieve the same? i tried huber_pose as well but not getting perfect result.

what is hubert_full_control?

nitinmukesh · 2024-08-19T17:00:28Z

@nitinmukesh @newgenai79 can we achieve cropping with 512 X 512 resolution instead of 256X 256 ?

I don't think it's possible since the model is trained on 256 x 256 resolution.

also i can see the blink and eyes movement are not very natural compares to echomimic, how we can achieve the same?

EchoMimic is good as it is trained on large dataset, hence the results are better.

Arvrairobo · 2024-08-20T16:02:51Z

@nitinmukesh echomimic is very slow, for 40 secs video it took about 2.5 hours so not a practical solution. anitalker is very fast, thats why i am building a solution on top of it.

if author @liutaocode or any author can shed some lights on it or give some headstart on how to achieve blink and facial expression, that would be great, or if they release a code or more trained model that would also work, looking forward to it

Arvrairobo · 2024-08-30T12:36:25Z

still waiting for blink feature, also any update on the project? any new release of the code? or any new features?

nitinmukesh mentioned this issue Aug 19, 2024

very fast inference, so far one of the best #39

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cropping issue discussion #40

Cropping issue discussion #40

nitinmukesh commented Aug 18, 2024 •

edited

Loading

nitinmukesh commented Aug 18, 2024

Arvrairobo commented Aug 19, 2024

Arvrairobo commented Aug 19, 2024 •

edited

Loading

nitinmukesh commented Aug 19, 2024 •

edited

Loading

Arvrairobo commented Aug 20, 2024

Arvrairobo commented Aug 30, 2024

Cropping issue discussion #40

Cropping issue discussion #40

Comments

nitinmukesh commented Aug 18, 2024 • edited Loading

nitinmukesh commented Aug 18, 2024

Arvrairobo commented Aug 19, 2024

Arvrairobo commented Aug 19, 2024 • edited Loading

nitinmukesh commented Aug 19, 2024 • edited Loading

Arvrairobo commented Aug 20, 2024

Arvrairobo commented Aug 30, 2024

nitinmukesh commented Aug 18, 2024 •

edited

Loading

Arvrairobo commented Aug 19, 2024 •

edited

Loading

nitinmukesh commented Aug 19, 2024 •

edited

Loading