documentation

singnet · May 30, 2024 · 490af84 · 490af84
1 parent cf04176
commit 490af84
Showing 1 changed file with 184 additions and 0 deletions.
diff --git a/docs/api.md b/docs/api.md
@@ -0,0 +1,184 @@
+
+# metafusion library
+
+# Pipes classes
+
+Pipes classes implement different ways to generate 
+or process images using diffusion models.
+
+**Prompt2ImPipe** is a pipe that generates an image from text prompt.
+
+
+```example
+pipe = Prompt2ImPipe(model_id='runwayml/stable-diffusion-v1-5')
+pipe.setup(width=768, height=768, clip_skip=2)
+image = pipe.gen({'prompt': 'bio-tech lab with computers and exotic flowers, artwork'})
+image.save('bio.png')
+```
+
+***Im2ImPipe** is a pipe that generates an image from another image.
+
+```
+pipe = Im2ImPipe(model_id='runwayml/stable-diffusion-v1-5')
+pipe.setup("./_projects/biolab/00000.png", strength=0.5, steps=25)
+img = pipe.gen({'prompt': 'biolab, anime style drawing, green colors'})
+img.save('bio1.png')
+```
+
+**Cond2ImPipe** is a pipe that generates 
+an image from another image plus conditioning 
+image e.g. image after canny edge detection etc.  
+Conditioning image is processed internally with controlnet and  
+uses StableDiffusion(XL)ControlNetPipeline
+
+Models are expected to be in ./models-cn/ for StableDiffusion 
+and in ./models-cn-xl/ for StableDiffusionXL
+
+**CIm2ImPipe** is similiar to Cond2ImPipe.  
+The difference is that the conditional image is not
+taken as input but is obtained from the input image, which
+should be processed, and the image processor 
+depends on the conditioning type.
+
+```
+model_id = 'runwayml/stable-diffusion-v1-5'
+pipe = CIm2ImPipe(model_id, model_type=ControlnetType.SD)
+pipe.setup("./bio1.png", strength=0.5, steps=25, ctypes=['soft'])
+img = pipe.gen({'prompt': 'biolab, anime style drawing, green colors'})
+img.save('bio2.png')
+```
+
+possible values for ctypes:
+* 'canny' - canny edge detection
+* 'soft-sobel' - sobel edge detection
+* 'soft' - same as soft-sobel with different edge detector.	 
+* 'depth' - depth map
+* 'pose' - A OpenPose bone image.
+* 'ip2p' - original image will be used as control
+* 'inpaint' - original image will be thresholded and inpainted(use InpaintingPipe for this option)
+* 'qr' - original image will be used as control.
+
+
+**MaskedIm2ImPipe** is image to image pipeline that uses mask to redraw only certain parts of the input image.
+It can be used as an inpainting pipeline with any non-inpaint models.
+The pipeline computes mask from the difference between  
+original image and image with a mask on it. Color of the mask affects the result.
+
+```
+blur = 8
+model_id = 'runwayml/stable-diffusion-v1-5'
+img = PIL.Image.open("./mech_beard_sigm.png")
+# read image with mask painted over
+img_paint = numpy.array(PIL.Image.open("./mech_beard_sigm_mask.png"))
+scheduler = "EulerAncestralDiscreteScheduler" # gives good results
+
+pipe = MaskedIm2ImPipe(model_dir+model_id)
+pipe.setup(original_image=img, image_painted=img_paint, strength=0.85,
+           scheduler=scheduler, guidance_scale=7, clip_skip=3, blur=blur)
+
+prompt = "a man wearing a mask"
+img = pipe.gen({'prompt': prompt, 'seed':84958344})
+img.save('inpaint1.png')
+```
+
+
+## metafusion service
+
+**ServiceThread** from multigen.worker module implements  
+generation queue that can be used to implement e.g. web service.
+
+This class needs two config files, one(config.yaml in our example) that specifies directories to use in ServiceThread
+
+*config.yaml*
+```config.yaml
+model_list: models.yaml   # 
+model_dir: ./models-sd/
+logging_folder: ./logs/
+```
+Another file *models.yaml* specifies models and pipelines.
+
+```
+base:
+  sdxl1:
+    # https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/blob/main/sd_xl_base_1.0.safetensors
+    id: SDXL/stable-diffusion-xl-base-1.0/sd_xl_base_1.0.safetensors
+    xl: True
+lora:
+  # https://huggingface.co/ByteDance/Hyper-SD/blob/main/Hyper-SDXL-4steps-lora.safetensors
+  hypersd4steps:
+    id: Hyper-SDXL-4steps-lora.safetensors
+    xl: True
+
+pipes:
+    prompt2image:
+        name: "Prompt to Image"
+        classname: Prompt2ImPipe
+        need_imgs: False
+        needs_cnet: False
+```
+
+
+Before scheduling image generation one needs to create a session. Session determines where
+generation results are stored both inside the ServiceThread and on filesystem.
+open_session returns a dictionary containing "session_id" which can  
+be used later to query generation results.
+
+This argument are expected by **open_session**:  
+*user* - user name  
+*project* - project name, determines where generated results are stored.  
+*pipe* - The name of the pipe to use for the session, the one specified in models config
+file under "pipes" field.  
+*model* - The name of the model to use for the session, the one specified in config
+file under "base" field.  
+*loras* - list of lora models to load, loras must be present in models config under "lora" field.
+
+**queue_gen** is used to schedule generation of images. 
+These keyword arguments are expected:
+*session_id* - one of the session ids returned by open_session  
+*count* - number of images to generate  
+*image_callback* - callback function that accepts one argument - path to generated image.  
+Other arguments are passed as is to setup() method of the pipeline.
+and *prompt* is passed to gen().
+
+full example:
+```
+cfg_file = 'config.yaml'
+
+worker = ServiceThread(cfg_file)
+worker.start()
+
+pipe = "prompt2image"
+model = 'sdxl1'
+result = worker.open_session(
+    user='test',
+    project="results",
+    model=model,
+    pipe=pipe,
+    loras=['hypersd4steps'],
+)
+
+count = 5
+c = 0
+def on_new_image(*args, **kwargs):
+    print(args, kwargs)
+    global c
+    c += 1
+
+if 'error' not in result:
+    sess_id = result['session_id']
+    worker.queue_gen(session_id=sess_id,
+                    images=None,
+                    prompt=prompt,
+                    image_callback=on_new_image,
+                    lpw=True,
+                    width=1024, height=1024, steps=4,
+                    timestep_spacing='trailing',
+                    guidance_scale=0,
+                    scheduler='EulerAncestralDiscreteScheduler',
+                    count=count,
+                    seeds=[seed + i for i in range(count)],
+                    )
+
+```
+
+