Utilizing Inline Motion Vector Estimates from an H.264 Encoder to Implement Motion Detection

or Motion Detection Made Easy

Press Down to continue, or Swipe Up if you prefer.

Video is just lots of images
Simple encoders just encode lots of images (MJPEG):

JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG
Complex encoders exploit similarity of images (MPEG):

I-Frame P-Frame B-Frame B-Frame P-Frame B-Frame

Simplest frame type: it's just a picture. Also known as a key frame

Stores what changed since the last frame. Useful!

Like P-Frames, but with time travel.

Every frame, GPU calculates which macro blocks moved where.

What on earth is this?!

For each frame, Pi's camera produces a stream of inline motion data. Consists of X, Y, and SAD values for each macro-block:

The picamera library does most of the tedious stuff (since 1.5)…

Crude first steps...

How about a histogram of magnitudes?

Left as an exercise for the reader...

Use SAD values to filter good blocks
Define features in motion (convex hull? feature labelling?)
- to ignore lighting effects
- ... and cats
Combine with image data (e.g. ignore certain colours?)

The Twitter feed: @waveform80

Because Fiona insists!

Now go play with this and make something interesting!