any inference code or something to check the model #8

Occupying-Mars · 2023-11-16T08:18:51Z

No description provided.

truebit · 2023-11-23T13:24:07Z

After some investigation, I replicated inference code using the same goal with supplied one/multiple snapshots and history actions.
It is not working very well on zero-shot situations.

YiDa858 · 2023-12-25T01:35:57Z

@truebit Can you publish your inference code? I would appreciate it!

kirtishrinkhala · 2024-01-09T00:12:28Z

@truebit Please share the inference code if possible.

kirtishrinkhala · 2024-01-09T23:29:48Z

I have been working on writing the inference code, here is what I could achieve till now. I wrote a function to produce the processed input for an image and the goal. However, now I am not sure on how to use that as an input on a pretrained model.

This is the code that I wrote to process the image file and the goal:

`

import action_type, action_matching
import tensorflow as tf
import numpy as np
from tqdm import tqdm
import json
import jax.numpy as jnp
import argparse
import pickle
import torch
import tensorflow as tf
from PIL import Image
from transformers import AutoProcessor, Blip2Model

device = "cuda" if torch.cuda.is_available() else "cpu"
model = Blip2Model.from_pretrained("Salesforce/blip2-opt-2.7b", torch_dtype=torch.float16)
model.to(device)
processor = AutoProcessor.from_pretrained("Salesforce/blip2-opt-2.7b")

def parse_image(
    image_file_path
):

    goal = "How to login?"
    step_id = "123"
        # episode_id = ex.features.feature['episode_id'].bytes_list.value[0].decode('utf-8')
    output_ep = {
        "goal": goal,
        "step_id": step_id
    }

    img = Image.open('sample.png')


    image_height = img.height
    image_height
    image_width = img.width
#     image_channels = img.getChannel()
    with torch.no_grad():
        inputs = processor(images=img, return_tensors="pt").to(device, torch.float16)
        image_features = model.get_image_features(**inputs).pooler_output[0]
        image_features = image_features.detach().cpu()
    output_ep["image"] = image_features
    output = []
    output.append(output_ep)
    parsed_episode = []
    parsed_episode.append({"episode_id":123, "data":output})
    return parsed_episode


def parse_args():
    parser = argparse.ArgumentParser()
    parser.add_argument('--dataset', type=str, default='general')
    parser.add_argument("--split_file", type=str, default="dataset/general_texts_splits.json")
    parser.add_argument('--output_dir', type=str, default='dataset')
    parser.add_argument('--get_images', default=True, action='store_true')
    parser.add_argument('--get_annotations', default=True, action='store_true')
    parser.add_argument('--get_actions', default=True, action='store_true')
    parser.add_argument('--file_path', type=str, default='sample.png')
    
    args = parser.parse_args()
    return args

if __name__ == '__main__':

    args = parse_args()
    print('====Input Arguments====')
    print(json.dumps(vars(args), indent=2, sort_keys=False))

    all_parsed_episode = parse_image(args.file_path)

    with open(f"{args.output_dir}_test_val.obj", "wb") as wp:
        pickle.dump(all_parsed_episode,wp)

`

Jiayi-Pan · 2024-04-09T03:11:53Z

Hi friends,

We’ve got AutoUI running and tested its end-to-end performance in our recent paper. You can find the inference code here

https://github.com/Berkeley-NLP/Agent-Eval-Refine/tree/main/exps/android_exp/models/Auto-UI

Yingrjimsch · 2024-04-30T20:18:50Z

Hi friends,

We’ve got AutoUI running and tested its end-to-end performance in our recent paper. You can find the inference code here

https://github.com/Berkeley-NLP/Agent-Eval-Refine/tree/main/exps/android_exp/models/Auto-UI

Great job thanks will try that 👍 any insights in how good it works for zero shot approaches?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

any inference code or something to check the model #8

any inference code or something to check the model #8

Occupying-Mars commented Nov 16, 2023

truebit commented Nov 23, 2023

YiDa858 commented Dec 25, 2023

kirtishrinkhala commented Jan 9, 2024

kirtishrinkhala commented Jan 9, 2024 •

edited

Loading

Jiayi-Pan commented Apr 9, 2024

Yingrjimsch commented Apr 30, 2024

any inference code or something to check the model #8

any inference code or something to check the model #8

Comments

Occupying-Mars commented Nov 16, 2023

truebit commented Nov 23, 2023

YiDa858 commented Dec 25, 2023

kirtishrinkhala commented Jan 9, 2024

kirtishrinkhala commented Jan 9, 2024 • edited Loading

Jiayi-Pan commented Apr 9, 2024

Yingrjimsch commented Apr 30, 2024

kirtishrinkhala commented Jan 9, 2024 •

edited

Loading