TensorRT using Nvidia GPU #3016

yeahme49 · 2022-03-25T23:36:10Z

yeahme49
Mar 25, 2022

I was able to make an Nvidia GPU work for detection by modifying this https://github.com/yury-sannikov/frigate/tree/gstreamer (#2548)

I started by using the watsor dockerfile (https://github.com/asmirnou/watsor/blob/master/docker/Dockerfile.base) but updating it for ubuntu 20.04: https://pastebin.com/ERhsFwEi
and then the watsor gpu dockerfile (https://github.com/asmirnou/watsor/blob/master/docker/Dockerfile.gpu.base) using my new local image: https://pastebin.com/HnBpHWJX

Then I modified the yolo4 converter files for amd64 instead of arm.
Dockerfile: https://pastebin.com/iHbGPR9Z
build.sh: https://pastebin.com/SnA9gibi
assets/install_protobuf.sh: https://pastebin.com/xHfdG3V8
assets/run.sh: https://pastebin.com/XPC0qmCe

Once the models completed, I copied the .trt files from yolo4/model and the .so from yolo4/plugin to a tensor_assets folder i made in my build folder. Then I updated the Dockerfile for amd64 instead of arm: https://pastebin.com/Ey6vFyRh

Once I had that image I was able to have my docker-compose file use that image instead of the standard frigate image. Added
model: path: /yolo4/yolov4-tiny-416.trt labelmap_path: /labelmap.txt width: 416 height: 416 to my config.yml file and started it up.

Using a 2GB Quadro P400 card my inference speed is around 16-17ms (9-10ms if using yolo4-tiny-288.trt). I run frigate on a Proxmox LXC container with 3 CPU cores assigned and 2GB of RAM, using nvidia hardware decoder. Previously my inference times were between 80-100ms and 75-80% CPU usage on that container.

I'm sure the docker files can be cleaned up and the process streamlined, there is probably some overlap between what gets built and installed between the different docker images since I was combining multiple other docker files.

If anyone is able to test this out and see if it works for them as well that would be awesome. Let me know if anything I posted doesn't work incase I forgot to add any changes i had to make. If others can get it working hopefully it can get cleaned up and added at some point.

Jeck-Liu-Create · 2022-04-16T02:44:42Z

Jeck-Liu-Create
Apr 16, 2022

Hi guys, I'm having some problems，
An error message was encountered while building docker ：

#26 [web 1/4] FROM docker.io/library/node:16@sha256:ffe804d6fcced29bcfc3477de079d03a9c2b0e4917e44bfeafb1a6b0f875e383
#26 sha256:bc31ad4563208bbf10475628f9d494c225b44a4f8f94780595a5cf41ee2eb2a0
#26 CANCELED
------
 > [wheels 2/5] RUN apt-get -qq update     && apt-get -qq install -y     apt-transport-https     gnupg     wget     && wget -O - http://archive.raspberrypi.org/debian/raspberrypi.gpg.key | apt-key add -     && echo "deb http://archive.raspberrypi.org/debian/ bullseye main" | tee /etc/apt/sources.list.d/raspi.list     && apt-get -qq update     && apt-get -qq install -y python3     python3-dev     wget     build-essential cmake git pkg-config libgtk-3-dev     libavcodec-dev libavformat-dev libswscale-dev libv4l-dev     libxvidcore-dev libx264-dev libjpeg-dev libpng-dev libtiff-dev     gfortran openexr libatlas-base-dev libssl-dev    libtbb2 libtbb-dev libdc1394-22-dev libopenexr-dev     libgstreamer-plugins-base1.0-dev libgstreamer1.0-dev     gcc gfortran libopenblas-dev liblapack-dev:
------
executor failed running [/bin/sh -c apt-get -qq update     && apt-get -qq install -y     apt-transport-https     gnupg     wget     && wget -O - http://archive.raspberrypi.org/debian/raspberrypi.gpg.key | apt-key add -     && echo "deb http://archive.raspberrypi.org/debian/ bullseye main" | tee /etc/apt/sources.list.d/raspi.list     && apt-get -qq update     && apt-get -qq install -y python3     python3-dev     wget     build-essential cmake git pkg-config libgtk-3-dev     libavcodec-dev libavformat-dev libswscale-dev libv4l-dev     libxvidcore-dev libx264-dev libjpeg-dev libpng-dev libtiff-dev     gfortran openexr libatlas-base-dev libssl-dev    libtbb2 libtbb-dev libdc1394-22-dev libopenexr-dev     libgstreamer-plugins-base1.0-dev libgstreamer1.0-dev     gcc gfortran libopenblas-dev liblapack-dev]: exit code: 100
make: *** [Makefile:34: frigate] Error 1

1 reply

yeahme49 Apr 21, 2022
Author

Did you use my modified files? That error looks like it's using raspberry pi files, which my modded files shouldn't be using.

gavink2 · 2022-05-04T10:21:47Z

gavink2
May 4, 2022

I would love to try this as I too have a Quadro P400 that I'm just using for hardware decode at the moment, and I can't get hold of a Coral. However, I'm more of a docker user than a docker builder so I'm finding your instructions are a bit too vague for me to figure out what I need to do. Would be really great to see all the commands you used here, especially for the docker builds.

0 replies

nordeep · 2022-05-10T12:34:27Z

nordeep
May 10, 2022

@yeahme49 great job! I'm successfully run Frigate with my GeForce GTX 750 Ti 2GB by your instruction. For a future build, we need to describe it in more details.

0 replies

yeahme49 · 2022-06-23T20:15:27Z

yeahme49
Jun 23, 2022
Author

I've uploaded my latest image which should have all the latest 0.11.0 changes in it to docker hub.
https://hub.docker.com/r/yeahme49/frigatetensor

Should be able to use yeahme49/frigatetensor:latest to try this out without having to build the image yourself. Let me know how it works.

5 replies

atlury Jun 24, 2022

Thank you! I will try this out and report the feedback on i3-8100, 8GB Ram and Nvidia GT 1030

atlury Jun 24, 2022

Can you please share any sample config for 1 or 2 camera setup?

Edit: Never mind found it below!

kabadisha Dec 26, 2022

@yeahme49 Awesome!
Thanks for the great work. Got it working using your docker image but with one small niggle: I had to manually create the directory /yolo4 before the container would start.

Actually, I'm running on an unRaid host I had to quickly use docker exec <container_name> mkdir -p /yolo4 while I started the container from the unRaid UI.

Did I miss something? Or maybe you can update your docker image to create the dir?
Thanks :-)

NickM-27 Dec 26, 2022
Collaborator Sponsor

I believe you need to map that directory on the host side to the container, the container shouldn't be the one creating that.

kabadisha Dec 26, 2022

I believe you need to map that directory on the host side to the container, the container shouldn't be the one creating that.

Good thinking! So obvious. Thanks for the suggestion - worked a treat.

yeahme49 · 2022-06-23T22:44:06Z

yeahme49
Jun 23, 2022
Author

I didn't test any gstreamer stuff since I thought that was targeted to the Nvidia Jetson stuff and this is for PC so it might be missing. I'll look into that

…

On Thursday, June 23, 2022, urbydoo ***@***.***> wrote: This camera config: northwest: gstreamer: video_format: video/x-h265 audio_format: audio/x-alaw inputs: - path:xxxxx is causing: ------------------------------ *** Config Validation Errors *** ------------------------------ argument of type 'NoneType' is not iterable [2022-06-23 15:16:19] frigate.gstreamer ERROR : gst-inspect-1.0 failed with the message: Traceback (most recent call last): File "/opt/frigate/frigate/gstreamer.py", line 73, in gst_inspect_find_codec data = sp.check_output( File "/usr/lib/python3.9/subprocess.py", line 424, in check_output return run(*popenargs, stdout=PIPE, timeout=timeout, check=True, File "/usr/lib/python3.9/subprocess.py", line 505, in run with Popen(*popenargs, **kwargs) as process: File "/usr/lib/python3.9/subprocess.py", line 951, in *init* self._execute_child(args, executable, preexec_fn, close_fds, File "/usr/lib/python3.9/subprocess.py", line 1823, in _execute_child raise child_exception_type(errno_num, err_msg, err_filename) FileNotFoundError: [Errno 2] No such file or directory: 'gst-inspect-1.0' Traceback (most recent call last): File "/opt/frigate/frigate/app.py", line 313, in start self.init_config() File "/opt/frigate/frigate/app.py", line 83, in init_config self.config = user_config.runtime_config File "/opt/frigate/frigate/config.py", line 1088, in runtime_config camera_config.create_decoder_cmds() File "/opt/frigate/frigate/config.py", line 691, in create_decoder_cmds gst_cmd = self._get_gstreamer_cmd(self.gstreamer, input) File "/opt/frigate/frigate/config.py", line 759, in _get_gstreamer_cmd get_gstreamer_builder(self.detect.width, self.detect.height, self.name) File "/opt/frigate/frigate/gstreamer.py", line 356, in get_gstreamer_builder if builder.accept(available_plugins): File "/opt/frigate/frigate/gstreamer.py", line 336, in accept if plugin not in plugins: TypeError: argument of type 'NoneType' is not iterable ------------------------------ *** End Config Validation Errors *** ------------------------------ Did i botch my config or is g-inspect missing from the container? — Reply to this email directly, view it on GitHub <#3016 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AN6Z4QD4E6GEQA64DE2SSPTVQTQ5RANCNFSM5RVRZWOQ> . You are receiving this because you were mentioned.Message ID: ***@***.*** com>

2 replies

urbydoo Jun 23, 2022

my fault, i'll flip back to fmpeg, i was going down that path given you started with that fork.

Is this the right detector setup?
detectors:
JetsonNano:
type: tensorrt

yeahme49 Jun 23, 2022
Author

This is what I have

detectors:
  cuda:
    type: tensorrt

and

model:
  path: /yolo4/yolov4-tiny-416.trt
  labelmap_path: /labelmap.txt
  width: 416
  height: 416

This image probably won't work with a jetson even with ffmpeg since it was built for amd64 not arm, but I could be wrong.

gavink2 · 2022-06-24T09:47:52Z

gavink2
Jun 24, 2022

It works! Thank you so much @yeahme49! I'm using a Xeon D-2146NT, 64GB RAM and a Quadro P400. For reference, here's my minimal setup:

docker-compose.yml:

version: "3.8"
services:
  frigatetensor:
    container_name: frigatetensor
    privileged: true # this may not be necessary for all setups
    restart: unless-stopped
    image: yeahme49/frigatetensor:latest
    runtime: nvidia
    environment:
      - NVIDIA_VISIBLE_DEVICES=all
      - NVIDIA_DRIVER_CAPABILITIES=compute,utility,video
      - FRIGATE_RTSP_PASSWORD=password
    shm_size: "64mb" # update for your cameras based on calculation above
    volumes:
      - /etc/localtime:/etc/localtime:ro
      - ./config.yml:/config/config.yml:ro
      - /var/lib/frigate:/media/frigate
      - type: tmpfs # Optional: 1GB of memory, reduces SSD/SD Card wear
        target: /tmp/cache
        tmpfs:
          size: 1000000000
    ports:
      - "5000:5000"
      - "1935:1935" # RTMP feeds

config.yml:

mqtt:
  host: <host-ip>
  user: frigate
  password: <password>
cameras:
  gate:
    ffmpeg:
      input_args:
        - -c:v
        - h264_cuvid
      inputs:
        - path: rtsp://<camera1-url>
          roles:
            - record
            - detect
  front_door:
    ffmpeg:
      input_args:
        - -c:v
        - h264_cuvid
        - -rtsp_transport
        - tcp
      inputs:
        - path: rtsp://<camera2-url>
          roles:
            - record
            - detect
record:
  enabled: True
detectors:
  cuda:
    type: tensorrt
model:
  path: /yolo4/yolov4-tiny-416.trt
  labelmap_path: /labelmap.txt
  width: 416
  height: 416

which gives me the following output from nvidia-smi:

Fri Jun 24 09:26:10 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 510.73.08    Driver Version: 510.73.08    CUDA Version: 11.6     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Quadro P400         Off  | 00000000:65:00.0 Off |                  N/A |
| 34%   46C    P0    N/A /  N/A |    545MiB /  2048MiB |      1%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A    301084      C   python3                           269MiB |
|    0   N/A  N/A    301100      C   ffmpeg                            137MiB |
|    0   N/A  N/A    301106      C   ffmpeg                            137MiB |
+-----------------------------------------------------------------------------+

Frigate debug stats:

and I have a screenshot of me surrounded by a bounding box labelled 'person', but I'm not gonna post that...

3 replies

DarrenRiedlinger Jul 12, 2022

FYI: In your docker-compose.yaml, you have two blocks defining environment variables . I didn't notice the second environment definition at the bottom and was getting some weird behavior where newly added environment variables the first block (i.e. setting YOLO_MODELS=XXX) weren't getting set in the container (even though the original NVIDIA environment variables from your docker-compose were). Moving all the environment variables to a single block fixed it. Just pointing it out to potentially save someone else some head scratching.

gavink2 Jul 12, 2022

Thanks for pointing this out. I actually noticed a week ago when I was trying to add my Frigate+ API key and got the same behaviour as you. But then I forgot to update my post here. I suspect the NVIDIA_* variables are redundant as something, possibly runtime: nvidia, is setting their default values anyway. Have updated my comment regardless.

NickM-27 Jul 12, 2022
Collaborator Sponsor

I suspect the NVIDIA_* variables are redundant as something, possibly runtime: nvidia, is setting their default values anyway

Here are the docs for the 0.11 beta release which don't include that anymore

arkadiy-telegin · 2022-06-24T14:13:59Z

arkadiy-telegin
Jun 24, 2022

Getting some errors the pre-built image (yeahme49/frigatetensor:latest):

[06/24/2022-16:08:56] [TRT] [I] [MemUsageChange] Init CUDA: CPU +187, GPU +0, now: CPU 234, GPU 2650 (MiB)
[06/24/2022-16:08:56] [TRT] [I] Loaded engine size: 40 MiB
[06/24/2022-16:08:56] [TRT] [E] 6: The engine plan file is generated on an incompatible device, expecting compute 5.0 got compute 6.1, please rebuild.
[06/24/2022-16:08:56] [TRT] [E] 4: [runtime.cpp::deserializeCudaEngine::50] Error Code 4: Internal Error (Engine deserialization failed.)

Seems like I have a way too fresh version of CUDA?
I'm running version 1.9.0-1 of nvidia-container-toolkit.

4 replies

gavink2 Jun 24, 2022

What GPU do you have and what version of compute does it support? Might be that it's older than the Quadro P400 that @yeahme49 presumably built for. I have v1.10.0-1 of nvidia-container-toolkit, which works fine, so it's probably not that yours is too new. For reference, here are my nvidia packages on Ubuntu Server 20.04:

$ apt list --installed | grep nvidia
libnvidia-cfg1-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
libnvidia-compute-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
libnvidia-container-tools/bionic,now 1.10.0-1 amd64 [installed,automatic]
libnvidia-container1/bionic,now 1.10.0-1 amd64 [installed,automatic]
libnvidia-decode-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed]
nvidia-compute-utils-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
nvidia-container-toolkit/bionic,now 1.10.0-1 amd64 [installed,automatic]
nvidia-dkms-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
nvidia-docker2/bionic,now 2.11.0-1 all [installed]
nvidia-headless-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed]
nvidia-headless-no-dkms-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
nvidia-kernel-common-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
nvidia-kernel-source-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed,automatic]
nvidia-utils-510-server/focal-updates,focal-security,now 510.73.08-0ubuntu0.20.04.1 amd64 [installed]

atlury Jun 24, 2022

@gavink2

Is it possible to just post a layman step by step installation instructions? It will help newbies like us.

yeahme49 Jun 26, 2022
Author

Can you pull the docker image again? I rebuilt it for compute 3.5 and up now instead of just 6.1 so hopefully it should work.
Compute 2.0, 2.1 and 3.0 have been removed from the CUDA version being used so that's why it's 3.5 and up. You can use https://developer.nvidia.com/cuda-gpus to see what compute version your card uses.

urbydoo Jun 28, 2022

Getting a slightly different error, too low of a version.

[06/28/2022-15:38:55] [TRT] [I] Loaded engine size: 37 MiB
[06/28/2022-15:38:55] [TRT] [E] 6: The engine plan file is generated on an incompatible device, expecting compute 7.5 got compute 6.1, please rebuild.
[06/28/2022-15:38:55] [TRT] [E] 4: [runtime.cpp::deserializeCudaEngine::50] Error Code 4: Internal Error (Engine deserialization failed.)

kdill00 · 2022-06-28T22:25:37Z

kdill00
Jun 28, 2022

Yeah, this is absolutely phenomenal so far. I have used frigate 2 years, across 2 installations for 45 cameras and 7 tpu. I just replaced my home setup that was on two corals running 10 cameras and shifted to my quadro P2000. Im getting stellar performance with the yolov4 416 tiny. 4.8 ms or so. I am giddy as the model options are far wider. Next ill be deploying it at my business thats currently running 5 corals across 26 cameras. I just bought an nvidia A30 a few days ago to swap out the corals and move to use tensorrt on another platform. Very happy i can potentially stay with frigate. @blakeblackshear hopefully you can investigate see for yourself, and hopefully I dont speak to soon. This could give you far greater model flexibility as you move frigate+ forward.

Currently my quadro p2000 is showing about 6 to 8 percent of gpu memory being used per 1920p 5 to 7fps camera running detection. @yeahme49 If your ever in Oklahoma, I owe you a few brews.

1 reply

kdill00 Jun 28, 2022

and I meant to add that due to probably cpu, memory bottleknecks, as well as i have another quADRo decoding at my work, the 5 tpus have a rough time with 26, 8 fps cameras. im constantly tweaking motion, various things to reign in detection fps to try and get not waste those precious bounding boxes. This is what I hope have solved. Keeping all decoding, detection, ect on the card to stop these slow downs if possbile.

yeahme49 · 2022-06-28T23:41:22Z

yeahme49
Jun 28, 2022
Author

So it looks like the tensorrt engine will only work on the same generator of card as what i have (so anything pascal) and won't work with anything else. I'll try to make the image build the engine on first run on each machine so that it builds properly or have a second image that does the building of the engine and you can copy the files over. It may take a bit though.

So as of now it appears this will only work on pascal cards.

2 replies

kdill00 Jun 29, 2022

No worries. Appreciate everything so far.

arkadiy-telegin Jun 29, 2022

Thank you for the great work!

DarrenRiedlinger · 2022-07-01T15:03:00Z

DarrenRiedlinger
Jul 1, 2022

@yeahme49 Just wanted to reach out say thank you! I've been desperately waiting for a back-ordered Coral, but I've now got your image successfully running on my laptop with GeForce MX150 card. I still need to do some tuning on the detection, and am having issues with hwaccel (MX150 doesn't do NVENC; hoping I can eventually figure out a way I can pass in my other intel gpu and use vaapi for the decoding/encoding part), but the detection is successfully running in GPU now. Thanks so much!

1 reply

kdill00 Aug 2, 2022

you can specify the decoder per input. I was able to setup deploy to my GPU server, using an nvidia A30 to do detection as it also doesnt have nvenc for some reason only decoders. The gpu "device" number for detections seems to be coded into the layer file or trt model file generated when you first load up. From there just specify those hwaccel options in the config. I dont see why this wouldnt work as im using an P400 and P600 to decode 26 odd camera streams and then the A30 to run inference from that. It adds some over head as it goes from p400/p600 -> cpu > A30. Since its your cpu you may not see any added latency

mikegleasonjr · 2022-07-02T02:09:45Z

mikegleasonjr
Jul 2, 2022

Can I try it on a Turing card (GTX 1650)?

2 replies

yeahme49 Jul 2, 2022
Author

You can try it, it probably won't work though at this point

kdill00 Aug 2, 2022

I have it working on an nvidia A30. which is the next generation of the turing series.

QBANIN · 2022-07-04T19:54:23Z

QBANIN
Jul 4, 2022

How to rebuild ? :)

[TRT] [E] 6: The engine plan file is generated on an incompatible device, expecting compute 7.5 got compute 6.1, please rebuild.

0 replies

QBANIN · 2022-07-07T10:03:23Z

QBANIN
Jul 7, 2022

@yeahme49 would you please post some instructions how to rebuild against compute 7.5 from your github? I tried "docker build" but got too many errors. :(

0 replies

yeahme49 · 2022-07-08T18:01:53Z

yeahme49
Jul 8, 2022
Author

If you pull the latest image, it should now compile the models at first run if they don't exist so it should work on any system now.

A couple things are needed, in docker-compose.yml you need a path for the models so they get retained.

volumes:
    - /path/to/yolo4/directory:/yolo4

model in config.yml should look like

model:
  path: /yolo4/yolov4-tiny-416.trt
  labelmap_path: /labelmap.txt
  width: 416
  height: 416

or

model:
  path: /yolo4/yolov4-tiny-288.trt
  labelmap_path: /labelmap.txt
  width: 288
  height: 288

if you want to use the smaller 288 model

detectors:
  cuda:
    type: tensorrt

needs to be in config.yml also that it uses tensorrt

Let me know if this works, I tested it on my system and it worked but I only have my one card to test with, but I believe it should work for any nvidia card.

14 replies

yeahme49 Jul 9, 2022
Author

@yeahme49 BTW how to load test image to frigate?

https://docs.frigate.video/contributing#setup
I used that to have it load an mp4 file. I used ffmpeg to convert a jpg image into a 30 second mp4 file.

@yeahme49 This is my config https://pastebin.com/XTs5RZjw
Maybe you need to play with the min_score, threshold and min_area for objects possibly? My config doesn't change those from default at all. Not sure if that is causing issues with this different model or not.

QBANIN Jul 9, 2022

@yeahme49 BTW how to load test image to frigate?

https://docs.frigate.video/contributing#setup I used that to have it load an mp4 file. I used ffmpeg to convert a jpg image into a 30 second mp4 file.

@yeahme49 This is my config https://pastebin.com/XTs5RZjw
Maybe you need to play with the min_score, threshold and min_area for objects possibly? My config doesn't change those from default at all. Not sure if that is causing issues with this different model or not.

So, yolov4-tiny-416.trt works as expected (can detect all types of objects) with random HQ mp4 file downloaded from the internet but does not (I don't know why it works well with person detection only even if its far and blurry) with my not-so-high-quality HD camera unlike tflite Coral model bundled with frigate. Where can I download or how can I build better model like a non-tiny yolov4 or similar?

DarrenRiedlinger Jul 10, 2022

2GB VRAM. Sounds like the FP16 warnings can be ignored, not sure if the 2GB VRAM explains the cuMemcpyHtoDAsync failed error with the current docker image?
Thanks

My card has 2GB as well so that shouldn't be the problem. I'll have to do more looking into the issue and see what I can come up with.

Thanks @yeahme49. Any chance you still have and can share the previous docker image? For the time being I was hoping to roll back to the last working version, but didn't realize it was no longer on dockerhub when upgrading.

yeahme49 Jul 10, 2022
Author

I'm pushing a new image to docker hub that will allow disabling FP16 during compiling models, even though mine compiles fine with those errors we'll see if allowing you to disable FP16 will help. If not, I'll upload the compiled models from the old image that work on pascal.

DarrenRiedlinger Jul 12, 2022

I'm pushing a new image to docker hub that will allow disabling FP16 during compiling models, even though mine compiles fine with those errors we'll see if allowing you to disable FP16 will help. If not, I'll upload the compiled models from the old image that work on pascal.

Oddly, with the newest image even with the default yolov4-tiny-416 and FP16=true it's working for me. Haven't had a chance to play with the other models or see if there's an effect of disabling FP16, it looks like I'm back in business. Thanks!

QBANIN · 2022-07-08T19:12:22Z

QBANIN
Jul 8, 2022

@yeahme49 any chance to have 0.10-stable image with these features included?

1 reply

yeahme49 Jul 8, 2022
Author

Sorry only 0.11

IronBeardKnight · 2022-11-14T22:10:38Z

IronBeardKnight
Nov 14, 2022

Running on unraid from dockers 3060ti and working fantasticly. love your work. Get Outlook for Android<https://aka.ms/AAb9ysg>

…

________________________________ From: nickp27 ***@***.***> Sent: Tuesday, November 15, 2022 8:07:31 AM To: blakeblackshear/frigate ***@***.***> Cc: IronBeardKnight ***@***.***>; Comment ***@***.***> Subject: Re: [blakeblackshear/frigate] TensorRT using Nvidia GPU (Discussion #3016) Has the docker file for the tensorrt build been taken down? — Reply to this email directly, view it on GitHub<#3016 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ANU2ZQF5LKUXQGJDKSZ6YI3WIKZ2HANCNFSM5RVRZWOQ>. You are receiving this because you commented.Message ID: ***@***.***>

0 replies

reidprichard · 2022-12-20T19:28:38Z

reidprichard
Dec 20, 2022

Very cool! As a Docker noob, I was surprised how easy this was to install, just:

Change the image in frigate.yml from blakeblackshear/frigate:stable to yeahme49/frigatetensor:latest
Add /home/username/Documents/frigate/yolo4:/yolo4 to volumes (I also manually created the yolo4 directory; not sure if that was necessary)
Change my detector from cpu1: type: cpu to cuda: type: tensorrt
Add model: path: /yolo4/yolov4-tiny-416.trt labelmap_path: /labelmap.txt width: 416 height: 416 to the end of my config.yml file
And rerun the compose command!

I previously had inference times of 100-200ms with 2 CPUs; now it's ~10ms.

0 replies

phabibjr · 2022-12-21T04:51:43Z

phabibjr
Dec 21, 2022

Absolutely love this. Thank you so much for giving us this option. My Coral TPUs have been on backorder since April and I just happened across this project while researching GPU decoding for Frigate. I fired up a new VM to test it out and was very impressed, so much so that I promptly shut down the Frigate VM that was using the cpu detectors.

I'm currently trying to push Frigate to its limits. I have 15 cameras at 1080p or higher, detect is using the primary camera feed instead of the lower res sub feed. I have a Tesla P4 running at normal clocks for detect and also h264 decoding. The gpu is only at about 15% but the decoder is at 100% with an inference speed of around 5.15 (it was over 100 with cpu detectors even with using sub feeds for detect).

A quick question. I have a second Tesla P4 and hopefully some Coral TPUs soon. Can this version of Frigate utilize multiple GPUs or a mix of a GPU and the Coral TPUs for detectors? I've tried defining two GPU detectors using the device flags under each detector (device: pci:0 and device: pci:1) but only one is utilized. Thank you again for all your hard work.

0 replies

reidprichard · 2022-12-21T05:18:35Z

reidprichard
Dec 21, 2022

@phabibjr, have you checked to see that both GPUs are being passed to your container with the nvidia-smi command? I think to run that in the container the command would be sudo docker exec -it frigate nvidia-smi. If using docker-compose I believe you have to add an additional line in your .yml specifying each GPU's ID per the documentation.

4 replies

phabibjr Dec 21, 2022

Yes. Both gpus show up inside the container.

phabibjr Dec 21, 2022

Compose defaults to using all gpus but I’ve tried it with and without the ‘count: all’ argument under gpu reservation. In both cases NVIDIA-smi shows both cards but only one being utilized. It could be that it only uses one until it reaches high utilization then starts utilizing the next card. I’m going to play around with ffmpeg tonight to see if I can split up the decoding load.

NickM-27 Dec 21, 2022
Collaborator Sponsor

When it comes to ffmpeg, a GPU id has to be specified if multiple are available https://arstech.net/choose-gpu-in-ffmpeg/

phabibjr Dec 21, 2022

Thank you. I thought I had seen something about that before, I’ll give the article a read tonight and see what kind of results I get with it.

GCV-Sleeper-Service · 2022-12-23T00:06:47Z

GCV-Sleeper-Service
Dec 23, 2022

I am sorry if I am asking a question that has been asked before, just want to understand the current status:

Is currently a way to run Frigate on Jetson Nano 4GB, using the full potential of NVIDA's 128 cores?

If the answer is 'yes' - can I take such a 100% working image from a repository (where?) and run it, so it will work?
If the answer is still 'yes, but you need to build it yourself', then what is the exact procedure that will work 100% from start to end?

If the answer is 'not yet', then what is the status of the work done, if any?

Please understand - I am asking specifically for the use case with Jetson Nano. I don't have any other GPU/TPU sitting around, but I do have one Jetson...

Thanks.

1 reply

Blu-Eagle Aug 29, 2023

read my post of today :)

damsport11 · 2022-12-24T10:23:50Z

damsport11
Dec 24, 2022

First, @yeahme49 thanks for making this available. I am looking forward very much using it, however, I run into a persisting error that i cannot overcome for days now....
Composing goes well and does not show any errors, however, when running the container I get:

frigatetensor | [s6-init] making user provided files available at /var/run/s6/etc...exited 0.
frigatetensor | [s6-init] ensuring user provided files have correct perms...exited 0.
frigatetensor | [fix-attrs.d] applying ownership & permissions fixes...
frigatetensor | [fix-attrs.d] done.
frigatetensor | [cont-init.d] executing container initialization scripts...
frigatetensor | [cont-init.d] done.
frigatetensor | [services.d] starting services
frigatetensor | [services.d] done.
frigatetensor | [2022-12-24 11:04:23] frigate.app INFO : Starting Frigate (0.11.0-tensor-20cf27a)
frigatetensor | Starting migrations
frigatetensor | [2022-12-24 11:04:23] peewee_migrate INFO : Starting migrations
frigatetensor | There is nothing to migrate
frigatetensor | [2022-12-24 11:04:23] peewee_migrate INFO : There is nothing to migrate
frigatetensor | [2022-12-24 11:04:23] detector.cuda INFO : Starting detection process: 225
frigatetensor | [2022-12-24 11:04:23] frigate.app INFO : Output process started: 227
frigatetensor | [2022-12-24 11:04:23] ws4py INFO : Using epoll
frigatetensor | [2022-12-24 11:04:23] frigate.app INFO : Camera processor started for Tuin: 232
frigatetensor | [2022-12-24 11:04:23] frigate.app INFO : Camera processor started for Front: 234
frigatetensor | [2022-12-24 11:04:23] frigate.app INFO : Capture process started for Tuin: 236
frigatetensor | [2022-12-24 11:04:23] frigate.app INFO : Capture process started for Front: 240
frigatetensor | [2022-12-24 11:04:23] ws4py INFO : Using epoll
frigatetensor | [2022-12-24 11:04:23] frigate.detection.tensorrt ERROR : ERROR: failed to load /yolo4/libyolo_layer.so. /yolo4/libyolo_layer.so: failed to map segment from shared object
frigatetensor | [12/24/2022-11:04:23] [TRT] [I] [MemUsageChange] Init CUDA: CPU +304, GPU +0, now: CPU 357, GPU 407 (MiB)
frigatetensor | [12/24/2022-11:04:23] [TRT] [I] Loaded engine size: 12 MiB
frigatetensor | [12/24/2022-11:04:23] [TRT] [E] 1: [pluginV2Runner.cpp::load::293] Error Code 1: Serialization (Serialization assertion creator failed.Cannot deserialize plugin since corresponding IPluginCreator not found in Plugin Registry)
frigatetensor | [12/24/2022-11:04:23] [TRT] [E] 4: [runtime.cpp::deserializeCudaEngine::50] Error Code 4: Internal Error (Engine deserialization failed.)
frigatetensor | Process detector:cuda:
frigatetensor | Traceback (most recent call last):
frigatetensor | File "/usr/local/lib/python3.9/multiprocessing/process.py", line 315, in _bootstrap
frigatetensor | self.run()
frigatetensor | File "/usr/local/lib/python3.9/multiprocessing/process.py", line 108, in run
frigatetensor | self._target(*self._args, **self._kwargs)
frigatetensor | File "/opt/frigate/frigate/detection/init.py", line 86, in run_detector
frigatetensor | object_detector = object_detector_factory()
frigatetensor | File "/opt/frigate/frigate/detection/init.py", line 49, in _detector_factory
frigatetensor | object_detector = module.object_detector_factory(detector_config, model_path)
frigatetensor | File "/opt/frigate/frigate/detection/tensorrt.py", line 21, in object_detector_factory
frigatetensor | return LocalObjectDetector(detector_config, model_path)
frigatetensor | File "/opt/frigate/frigate/detection/tensorrt.py", line 126, in init
frigatetensor | self.input_shape = self._get_input_shape()
frigatetensor | File "/opt/frigate/frigate/detection/tensorrt.py", line 45, in _get_input_shape
frigatetensor | binding = self.engine[0]
frigatetensor | TypeError: 'NoneType' object is not subscriptable
frigatetensor | Exception ignored in: <function LocalObjectDetector.del at 0x7fd550570af0>
frigatetensor | Traceback (most recent call last):
frigatetensor | File "/opt/frigate/frigate/detection/tensorrt.py", line 145, in del
frigatetensor | del self.outputs
frigatetensor | AttributeError: outputs

the yolo4 directery does get filled up as expected and contains the required files. Any help would be highly appreciated ;-)
Enjoy X-mas!

0 replies

opnordahl · 2023-01-28T19:59:23Z

opnordahl
Jan 28, 2023

I am a Linux noob, but I finally managed to get this working on a Proxmox with in a docker container on a Ubuntu 22.04 LCX with Nvidia 515.86 driver (with CUDA 11.7) for Nvidia Geforce 970 card.

I used this guide to install the Nvidia drivers on host and LXC: https://gist.github.com/egg82/90164a31db6b71d36fa4f4056bbee2eb

And then this guide to install Nvidia container tooltkit: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#install-guide

Ran into some strange error messages when running the frigatetensor docker. This line fixed it:
sudo sed -i 's/^#no-cgroups = false/no-cgroups = true/;' /etc/nvidia-container-runtime/config.toml

Also had to define path:
volumes:
- /path/to/yolo4/directory:/yolo4

0 replies

LordNex · 2023-01-29T01:46:32Z

LordNex
Jan 29, 2023

Yea I don’t think the problem is with the GPU at the point. It’s now getting it to run on other platforms such as ARM and such. Personally I want to run CompreFace on a Jetson Nano and have Frigate on a VMWare Install of Ubuntu Server LTS with both a Nvidia card passed through for transcoding, and a USB Coral to handle the object detection. Then via MQTT pass that info to Double Take on my Home Assistant install which will send it to the Nano for Facial Recognition and Verification On Jan 28, 2023, at 1:59 PM, opnordahl ***@***.***> wrote: I am a Linux noob, but I finally managed to get this working on a Proxmox with in a docker container on a Ubuntu 22.04 LCX with Nvidia 515.86 driver (with CUDA 11.7) for Nvidia Geforce 970 card. I used this guide to install the Nvidia drivers on host and LXC: https://gist.github.com/egg82/90164a31db6b71d36fa4f4056bbee2eb And then this guide to install Nvidia container tooltkit: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#install-guide Ran into some strange error messages when running the frigatetensor docker. This line fixed it: sudo sed -i 's/^#no-cgroups = false/no-cgroups = true/;' /etc/nvidia-container-runtime/config.toml Also had to define path: volumes: - /path/to/yolo4/directory:/yolo4 — Reply to this email directly, view it on GitHub<#3016 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACGXNJHVPUN4A6QP4ZBNETLWUV3CPANCNFSM5RVRZWOQ>. You are receiving this because you commented.Message ID: ***@***.***>

0 replies

kajer · 2023-01-30T21:16:05Z

kajer
Jan 30, 2023

I deployed yeahme49/frigatetensor:latest on a TrueNAS SCALE box with ~20TB of storage and a Quadro A4000 I have ~8 4K cameras running with the default built model files and killer stability. @yeahme49 - If you are a SF / Seattle adjacent, I owe you beer.

I wrote a small how to for getting this container working on SCALE, and posted to /r/homelab and /r/frigate but was promptly ignored for including too many links in my guide :(

Question, Are there plans to include changes like .12 where MQTT becomes optional?

3 replies

NickM-27 Jan 30, 2023
Collaborator Sponsor

Question, Are there plans to include changes like .12 where MQTT becomes optional?

0.12 supports tensorrt directly, is there any reason it is not sufficient as is?

kajer Jan 30, 2023

I don't keep entirely up to date on frigate, this is good to know that official .12 supports TensorRT. Next time I don maintenance I will check it out. I have a combo of 3MP and 5MP RTSP cams, so I may be quite close to the memory limits of my personal QuadroP400 card

dinezttv Feb 24, 2024

Would love to see the guide for truenas scale.
Struggling a lot to get tensorrt running on frigate.

paulmro · 2023-02-05T19:58:08Z

paulmro
Feb 5, 2023

Hy all, is there any tutorial to achieve this that can be followed by noobs like me :D .. I nearly broken my HA server 2 times already trying to follow you

1 reply

NickM-27 Feb 5, 2023
Collaborator Sponsor

This is officially supported in frigate 0.12 which has documentation up

LordNex · 2023-02-05T21:01:35Z

LordNex
Feb 5, 2023

Is there a way to try out a beta or RC? On Feb 5, 2023, at 2:31 PM, Nicolas Mowen ***@***.***> wrote: This is officially supported in frigate 0.12 which has documentation up — Reply to this email directly, view it on GitHub<#3016 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACGXNJBR34ILUTFRJSIRWF3WWAE2BANCNFSM5RVRZWOQ>. You are receiving this because you commented.Message ID: ***@***.***>

2 replies

NickM-27 Feb 5, 2023
Collaborator Sponsor

@LordNex yes of course https://github.com/blakeblackshear/frigate/releases/tag/v0.12.0-beta7

LordNex Feb 5, 2023

@LordNex yes of course https://github.com/blakeblackshear/frigate/releases/tag/v0.12.0-beta7

@NickM-27 Wow!

LordNex · 2023-02-05T21:15:23Z

LordNex
Feb 5, 2023

And I’ve gotten a bit closer to testing it out on my Nano. I have it disassembled, now I just gotta flash the card with I’m guessing the standard Nano SDK and start adding dependencies. I’ll let you know how that goes. I’m also thinking of grabbing one of my old video cards and slapping it into the PowerEdge R620. Since Frigate is already running there fine with the Coral, I’ll probably try and pass it through to my Home Assistant VM where I have CompreFace installed or do a new build of Ubuntu Server and let CompreFace run inside its own VM with the GPU passed through to it. Then I’ll benchmark what runs Frigate best. The Nano with Coral and CompreFace on the R620. Or ditch the Nano all together and run Frigate in one VM with a Coral, another VM with CompreFace and a GPU passed through, and DoubleTake on Home Assistant to tie everything together. Lord_Nex On Feb 5, 2023, at 2:31 PM, Nicolas Mowen ***@***.***> wrote: This is officially supported in frigate 0.12 which has documentation up — Reply to this email directly, view it on GitHub<#3016 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACGXNJBR34ILUTFRJSIRWF3WWAE2BANCNFSM5RVRZWOQ>. You are receiving this because you commented.Message ID: ***@***.***>

0 replies

LordNex · 2023-02-05T21:58:47Z

LordNex
Feb 5, 2023

@NickM-27 When it's trying to connect to MQTT it's now failing. It wasn't previously and the only setting I changes was the image name. I still need to troubleshoot a few things on the MQTT end but like I said. It was connected and working with those same setting under 0.11.1

Also, when I goto Config and attempt to make a change and save I receive

"Could not write config file, be sure that Frigate has write permission on the config file."

10 replies

LordNex Feb 6, 2023

jsmpeg should continue working for the frigate card as is. However, you could instead enable birdseye restream and use the RTSP based restream with the camera entity instead

@NickM-27 Ok so I went through and added the new settings for birdseye to my config changing only the "mode:" value to "continuous". I've rebooted both HA with the Frigate Proxy Beta and Frigate but do not see any new "birdseye" camera entity such as "camera.frigate_birdseye" or something.

If I was to add the RSTP restream directly to the card; where and what would I put in the card entity? I'd prefer to keep using the Frigate Card with its integrations and not resort to adding some other card just to get the RSTP stream.

NickM-27 Feb 6, 2023
Collaborator Sponsor

jsmpeg should continue working for the frigate card as is. However, you could instead enable birdseye restream and use the RTSP based restream with the camera entity instead

@NickM-27 Ok so I went through and added the new settings for birdseye to my config changing only the "mode:" value to "continuous". I've rebooted both HA with the Frigate Proxy Beta and Frigate but do not see any new "birdseye" camera entity such as "camera.frigate_birdseye" or something.

If I was to add the RSTP restream directly to the card; where and what would I put in the card entity? I'd prefer to keep using the Frigate Card with its integrations and not resort to adding some other card just to get the RSTP stream.

I'd suggest reading the docs, because that mode is unrelated to restream

https://deploy-preview-4055--frigate-docs.netlify.app/configuration/restream

NickM-27 Feb 6, 2023
Collaborator Sponsor

Also, have you updated to the beta version of the integration? that is required as well

LordNex Feb 6, 2023

Also, have you updated to the beta version of the integration? that is required as well

I believe so. I upgraded it when I went to 0.11.1. Is there a different one that what I'm using below?

NickM-27 Feb 6, 2023
Collaborator Sponsor

I would highly suggest reading the release notes before updating to a release. If nothing else at least read the breaking changes section which is conveniently at the top of the release notes. The first item listed there:

If using the Frigate-HomeAssistant Integration it will need to be updated to 4.0.0 beta01 to avoid breaking changes with Frigate 0.12

olyashok · 2023-02-06T21:32:25Z

olyashok
Feb 6, 2023

folks - very excited to discover frigate/nvidia thread.

i started few years ago running deepstack then migrated to torchserve then migrated to nvidia triton inference server, used yolo v4/v5/v7 (soon to migrate to v8) all pushing notifications from 14 cameras to home assistant.

few questions:

anyone working on nvidia triton inference server detector - this would simplify things A LOT by decoupling inference
anyone working on nvidia deepstream integration
is there a simple way to bypass ffmpeg - i.e. just poll frames directly from camera if camera supports snapshots (to lower decoding load)?

1 reply

klutzzykitty Mar 19, 2023

@olyashok
I was planning on giving this a try.
Triton inference along with deepstream will drastically change the amount of compute utilised by the AI module.

Any discussions are most welcome !

klutzzykitty · 2023-03-19T11:29:39Z

klutzzykitty
Mar 19, 2023

@yeahme49

That's excellent work with getting frigate running on Nvidia GPUs.

Lately I saw that the docker image has been removed from your docker hub repo.
Although you mention there that its better to use V0.12 of frigate, it would be good to have access to your docker image.
Is there any way to share it ?

Secondly, are there any instructions on how to build the tensorrt docker image which you created ?
I don't know whether the instructions in the description i.e.

I started by using the watsor dockerfile (https://github.com/asmirnou/watsor/blob/master/docker/Dockerfile.base) but updating it for ubuntu 20.04: https://pastebin.com/ERhsFwEi
and then the watsor gpu dockerfile (https://github.com/asmirnou/watsor/blob/master/docker/Dockerfile.gpu.base) using my new local image: https://pastebin.com/HnBpHWJX

Then I modified the yolo4 converter files for amd64 instead of arm.
Dockerfile: https://pastebin.com/iHbGPR9Z
build.sh: https://pastebin.com/SnA9gibi
assets/install_protobuf.sh: https://pastebin.com/xHfdG3V8
assets/run.sh: https://pastebin.com/XPC0qmCe

have been updated.
It would be fantastic to have a brief run down on what to execute and what to change, in-order to build a tensorrt image.
(Although V0.12 exists, one with only tensorrt like this will also be nice).

Is it available on your frigate fork?
I see that it's a much more latest version that's in your fork of frigate compared to the discussions in this thread.

Thank you !!

0 replies

TensorRT using Nvidia GPU #3016

Replies: 66 comments · 94 replies

yeahme49 Apr 21, 2022 Author

yeahme49 Jun 23, 2022 Author

NickM-27 Dec 26, 2022 Collaborator Sponsor

yeahme49 Jun 23, 2022 Author

yeahme49 Jun 23, 2022 Author

NickM-27 Jul 12, 2022 Collaborator Sponsor

yeahme49 Jun 26, 2022 Author

yeahme49 Jun 28, 2022 Author

yeahme49 Jul 2, 2022 Author

yeahme49 Jul 8, 2022 Author

yeahme49 Jul 9, 2022 Author

yeahme49 Jul 10, 2022 Author

yeahme49 Jul 8, 2022 Author

NickM-27 Dec 21, 2022 Collaborator Sponsor

NickM-27 Jan 30, 2023 Collaborator Sponsor

NickM-27 Feb 5, 2023 Collaborator Sponsor

Replies: 66 comments 94 replies

yeahme49 Apr 21, 2022
Author

yeahme49
Jun 23, 2022
Author

NickM-27 Dec 26, 2022
Collaborator Sponsor

yeahme49
Jun 23, 2022
Author

yeahme49 Jun 23, 2022
Author

NickM-27 Jul 12, 2022
Collaborator Sponsor

yeahme49 Jun 26, 2022
Author

yeahme49
Jun 28, 2022
Author

yeahme49 Jul 2, 2022
Author

yeahme49
Jul 8, 2022
Author

yeahme49 Jul 9, 2022
Author

yeahme49 Jul 10, 2022
Author

yeahme49 Jul 8, 2022
Author

NickM-27 Dec 21, 2022
Collaborator Sponsor

NickM-27 Jan 30, 2023
Collaborator Sponsor

NickM-27 Feb 5, 2023
Collaborator Sponsor