Replies: 6 comments 2 replies
-
Running on recordings is a lot more complicated because you need an image to send. Also, needs something to be searchable. Could be manual event but that presents additional challenges and I'm not sure what the trigger would be. Perhaps the manual event API could have a use genai param but not sure if it would work well. |
Beta Was this translation helpful? Give feedback.
-
Once the motion recording has ended (been created). we can sample a still image every gap seconds using ffmpeg like amblegpt does and summarize the scene. There are other models like Florence 2 that do a great job of drawing bounding boxes on various detected objects within the snapshot scene to make the metadata quite rich. As a start, we can pick a point where there is motion, possibly the midpoint of the recording as the reference snapshot Just a suggestion. This can of course be refined |
Beta Was this translation helpful? Give feedback.
-
Just a sample snippet from amblegpt
|
Beta Was this translation helpful? Give feedback.
-
Just farm out the detection/summarization to something like openedai-vision or similar |
Beta Was this translation helpful? Give feedback.
-
Also worth mentioning that all of the work frigate does currently is available to APIs, so what you're suggesting could be built as an external sidecar service that doesn't require changes in frigate itself. |
Beta Was this translation helpful? Give feedback.
-
I agree all this is possible in an external side car like nodered. I like frigate for it's clip management and centralized store , camera management and API. Let's see how people end up using semantic tagging and search and how things unfurl |
Beta Was this translation helpful? Give feedback.
-
@hunterjm and @NickM-27
I just tried the semantic genai tagging enhancement using the 0.14.1 branch . Great start!
Even when frigate is used purely to record a busy street scene where object detection is disabled since it has a tendency to spike the CPU to uncontrollable levels, it makes sense to summarize a recording by creating a manual event and adding the applicable metadata. So it is searchable on a timeline
Thoughts?
Beta Was this translation helpful? Give feedback.
All reactions