Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

USB Host UVC driver version 2 #84

Merged
merged 2 commits into from
Dec 19, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 9 additions & 1 deletion .build-test-rules.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,15 +6,23 @@ host/class:
enable:
- if: SOC_USB_OTG_SUPPORTED == 1

host/class/uvc/usb_host_uvc/examples/camera_display:
enable:
- if: (IDF_VERSION_MAJOR >= 5 and IDF_VERSION_MINOR >= 3)
reason: This example uses esp_lcd API introduced in v5.3

# Host tests
host/class/cdc/usb_host_cdc_acm/host_test:
enable:
- if: IDF_TARGET in ["linux"] and (IDF_VERSION_MAJOR >= 5 and IDF_VERSION_MINOR >= 4)


host/class/hid/usb_host_hid/host_test:
enable:
- if: IDF_TARGET in ["linux"] and (IDF_VERSION_MAJOR >= 5 and IDF_VERSION_MINOR >= 4)
reason: USB mock was added in v5.4

host/class/uvc/usb_host_uvc/host_test:
enable:
- if: IDF_TARGET in ["linux"] and (IDF_VERSION_MAJOR >= 5 and IDF_VERSION_MINOR >= 4)
- if: IDF_TARGET in ["linux"] and (IDF_VERSION_MAJOR >= 5 and IDF_VERSION_MINOR >= 4)
reason: USB mock was added in v5.4
3 changes: 0 additions & 3 deletions .gitmodules
Original file line number Diff line number Diff line change
@@ -1,3 +0,0 @@
[submodule "host/usb_host_uvc/libuvc"]
path = host/class/uvc/usb_host_uvc/libuvc
url = ../../espressif/libuvc.git
5 changes: 5 additions & 0 deletions host/class/uvc/usb_host_uvc/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,8 @@
## 2.0.0

- New version of the driver, native to Espressif's USB Host Library
- Removed libuvc dependency

## 1.0.4

- Support printf frame based descriptor
Expand Down
41 changes: 13 additions & 28 deletions host/class/uvc/usb_host_uvc/CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -1,28 +1,13 @@
configure_file(${CMAKE_CURRENT_LIST_DIR}/libuvc/include/libuvc/libuvc_config.h.in
${CMAKE_CURRENT_BINARY_DIR}/include/libuvc/libuvc_config.h
@ONLY)

set(LIBUVC_SOURCES libuvc/src/ctrl.c
libuvc/src/ctrl-gen.c
libuvc/src/device.c
libuvc/src/diag.c
libuvc/src/frame.c
libuvc/src/init.c
libuvc/src/misc.c
libuvc/src/stream.c)

idf_component_register(
SRCS ${LIBUVC_SOURCES} src/descriptor.c src/libusb_adapter.c
INCLUDE_DIRS include libuvc/include
PRIV_INCLUDE_DIRS private_include
REQUIRES usb pthread)

set_source_files_properties(
${CMAKE_CURRENT_LIST_DIR}/libuvc/src/device.c PROPERTIES COMPILE_FLAGS -Wno-implicit-fallthrough)
set_source_files_properties(
${CMAKE_CURRENT_LIST_DIR}/libuvc/src/stream.c PROPERTIES COMPILE_FLAGS -Wno-unused-variable)
set_source_files_properties(
${CMAKE_CURRENT_LIST_DIR}/libuvc/src/diag.c PROPERTIES COMPILE_FLAGS -Wno-format)

target_compile_definitions(${COMPONENT_LIB} PRIVATE LIBUVC_NUM_TRANSFER_BUFS=4)
target_include_directories(${COMPONENT_LIB} PUBLIC ${CMAKE_CURRENT_BINARY_DIR}/include/)
idf_component_register(SRCS
"uvc_host.c"
"uvc_descriptor_parsing.c"
"uvc_descriptor_printing.c"
"uvc_frame.c"
"uvc_control.c"
"uvc_isoc.c"
"uvc_bulk.c"
INCLUDE_DIRS include
PRIV_INCLUDE_DIRS private_include include/esp_private
PRIV_REQUIRES heap
REQUIRES usb
)
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@

Apache License
Version 2.0, January 2004
http://www.apache.org/licenses/
Expand Down
34 changes: 18 additions & 16 deletions host/class/uvc/usb_host_uvc/README.md
Original file line number Diff line number Diff line change
@@ -1,24 +1,26 @@
# USB Host UVC Driver
# USB Host UVC Class Driver

[![Component Registry](https://components.espressif.com/components/espressif/usb_host_uvc/badge.svg)](https://components.espressif.com/components/espressif/usb_host_uvc)
![maintenance-status](https://img.shields.io/badge/maintenance-experimental-blue.svg)

This directory contains USB host UVC driver based on [libuvc](https://github.com/libuvc/libuvc) library. Support for `libuvc` is achieved by implementing adapter between [libusb](https://github.com/libusb/libusb) (underling host library used by `libuvc`) and [usb_host](https://docs.espressif.com/projects/esp-idf/en/latest/esp32s2/api-reference/peripherals/usb_host.html) library targeted for ESP SOCs.
This component contains an implementation of a USB Host UVC Class Driver that is implemented on top of the [USB Host Library](https://docs.espressif.com/projects/esp-idf/en/latest/esp32s2/api-reference/peripherals/usb_host.html).

## Usage
The UVC driver allows video streaming from USB cameras.

Reference [uvc_host_example](https://github.com/espressif/esp-idf/tree/master/examples/peripherals/usb/host/uvc) is similar to one found in `libuvc` repository with few additions:
1. Before calling `uvc_init()`, `initialize_usb_host_lib()` has to be called in order to initialize usb host library.
2. Since `libuvc` selects highest possible `dwMaxPayloadTransferSize` by default, user has to manually overwrite obatained value to 512 bytes (maximum transfer size supported by ESP32-S2/S3) before passing it to `uvc_print_stream_ctrl()` function.
3. Optionally, user can configure `libusb adapter` by passing appropriate parameters to `libuvc_adapter_set_config()`.
### Features
- Isochronous and Bulk transfers streaming
- Multiple video streams
- Frame buffers in PSRAM
- Video Stream format negotiation
- Stream overflow and underflow management

## Known limitations
### Usage

Having only Full Speed USB peripheral and hardware limited MPS (maximum packet size) to 512 bytes, ESP32-S2/S3 is capable of reading about 0.5 MB of data per second. When connected to Full Speed USB host, cameras normally provide resolution no larger than 640x480 pixels.
Following two supported formats are the most common (both encoded in MJPEG):
* 320x240 30 FPS
* 640x480 15 FPS
Following sequence diagram represents public API usage of the UVC driver. New frames are passed to user in a callback. This design offers flexible interface upon which more complex frame processing components can be built.

## Tested cameras
* Logitech C980
* CANYON CNE-CWC2
* Logitech C270
![UVC public API](docs/uvc_public_api.png)

### Additional information
- [Frequently Asked Questions](docs/FAQ.md)
- [Examples](examples/)
- [Architectural notes](docs/arch_notes.md)
88 changes: 88 additions & 0 deletions host/class/uvc/usb_host_uvc/docs/FAQ.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,88 @@
## Frame buffer size

<details>
<summary>Q1: My camera reports a `dwMaxVideoFrameSize` that is unnecessarily large during format negotiation, causing excessive RAM usage. How can I reduce this?</summary>

You can limit RAM usage during UVC stream opening by configuring the frame buffer size in the `uvc_host_stream_config_t.advanced.frame_size` parameter of the uvc_host_stream_open call. The table below provides frame sizes for typical video resolutions and standard MJPEG compression ratios, which can help you choose an appropriate size.

> **Note**: If you configure the frame buffer size smaller than the reported `dwMaxVideoFrameSize`, you might occasionally encounter the `UVC_HOST_FRAME_BUFFER_OVERFLOW` event. This indicates that the actual received frame exceeded the allocated buffer size.


| Resolution | Width (px) | Height (px) | Uncompressed Size (kB) | Compressed Size (10:1) (kB) | Compressed Size (15:1) (kB) | Compressed Size (20:1) (kB) |
|--------------------|------------|-------------|------------------------|----------------------------|-----------------------------|-----------------------------|
| QQVGA | 160 | 120 | 56.25 | 5.63 | 3.75 | 2.81 |
| HQVGA | 240 | 160 | 112.50 | 11.25 | 7.50 | 5.63 |
| QVGA | 320 | 240 | 225.00 | 22.50 | 15.00 | 11.25 |
| VGA | 640 | 480 | 900.00 | 90.00 | 60.00 | 45.00 |
| SVGA | 800 | 600 | 1406.25 | 140.63 | 93.75 | 70.31 |
| XGA | 1,024 | 768 | 2304.00 | 230.40 | 153.60 | 115.20 |
| HD | 1,280 | 720 | 2700.00 | 270.00 | 180.00 | 135.00 |
| WXGA | 1,280 | 800 | 3000.00 | 300.00 | 200.00 | 150.00 |
| SXGA | 1,280 | 1,024 | 3840.00 | 384.00 | 256.00 | 192.00 |
| UXGA | 1,600 | 1,200 | 5625.00 | 562.50 | 375.00 | 281.25 |
| Full HD (1080p) | 1,920 | 1,080 | 6075.00 | 607.50 | 405.00 | 303.75 |
| WUXGA | 1,920 | 1,200 | 6750.00 | 675.00 | 450.00 | 337.50 |
| QHD (1440p) | 2,560 | 1,440 | 10800.00 | 1080.00 | 720.00 | 540.00 |
| 4K UHD | 3,840 | 2,160 | 24300.00 | 2430.00 | 1620.00 | 1215.00 |
| 5K | 5,120 | 2,880 | 43200.00 | 4320.00 | 2880.00 | 2160.00 |
| 8K UHD | 7,680 | 4,320 | 97200.00 | 9720.00 | 6480.00 | 4860.00 |

</details>

## FAQ H265 encoding

<details>
<summary>Q1: I have two input frames inputI.hevc, which is full Intra-coded frame and a second inputP.hevc which is Predicted frame. How do I decode the second frame to png?</summary>

### Decoding a P-frame Using an I-frame in HEVC

To decode a P-frame using the corresponding I-frame, you need to combine both frames into a single stream since a P-frame requires the reference frame (I-frame) to be properly decoded. Here’s how you can do it using FFmpeg:

#### Step 1: Combine the I-frame and P-frame into a Single Stream

1. **Concatenate the frames**: Create a text file with the paths to the input HEVC files in the order they should be decoded.

Create a file `inputs.txt` with the following content:
```plaintext
file 'inputI.hevc'
file 'inputP.hevc'
```

2. **Concatenate the HEVC files using FFmpeg**:
```sh
ffmpeg -f concat -safe 0 -i inputs.txt -c copy combined.hevc
```

#### Step 2: Extract and Decode the Second Frame

1. **Decode the combined HEVC stream and extract the second frame**:
```sh
ffmpeg -i combined.hevc -vf "select=eq(n\,1)" -vsync vfr second_frame.png
```

#### Explanation

- The `-f concat -safe 0 -i inputs.txt -c copy combined.hevc` command concatenates the I-frame and P-frame into a single HEVC file.
- The `-vf "select=eq(n\,1)" -vsync vfr second_frame.png` command extracts and decodes the second frame (P-frame) from the combined stream.

#### Complete Command Sequence

1. **Create `inputs.txt`**:
```plaintext
file 'inputI.hevc'
file 'inputP.hevc'
```

2. **Combine the HEVC files**:
```sh
ffmpeg -f concat -safe 0 -i inputs.txt -c copy combined.hevc
```

3. **Extract and decode the second frame**:
```sh
ffmpeg -i combined.hevc -vf "select=eq(n\,1)" -vsync vfr second_frame.png
```

By following these steps, you should be able to decode the second frame (P-frame) using the initial I-frame and convert it to PNG.

</details>
38 changes: 38 additions & 0 deletions host/class/uvc/usb_host_uvc/docs/arch_notes.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
## Architectural Notes

This driver utilizes two distinct types of memory buffers, both of which can be configured per UVC stream during stream initialization via the `uvc_host_stream_config_t.advanced` structure. The buffer types are as follows:

### 1. URB (USB Request Block)
- **Definition:** Provided by the USB Host Library in ESP-IDF.
- **Ownership:** Managed by the USB Host Library.
- **Placement:** Determined by ESP-IDF settings (internal or external RAM).
- **Purpose:** Transfers raw data from the USB device to the ESP host.
- **Behavior:**
- The driver continuously resubmits the URB until streaming stops.
- After submission, the CPU is interrupted when the URB is completed.
- The buffer size impacts interrupt frequency—larger buffers result in fewer interrupts.
- **Recommendation:** Use triple buffering for optimal performance:
- One buffer is processed by the driver.
- One buffer is actively transferring data.
- One buffer is queued for submission.
- **Driver Role:** Processes data from the URB and reconstructs video frames.

### 2. FB (Frame Buffer)
- **Definition:** Custom buffer type defined within this driver.
- **Ownership:** Dynamic ownership:
- Empty FBs are owned by the driver.
- Once a full frame is reconstructed and stored in the FB, it is passed to the user via `uvc_host_frame_callback_t`.
- After processing the frame, the user must return the FB to the driver, either by:
- Returning `true` from the callback, or
- Explicitly calling `uvc_host_frame_return()`.
- **Recommendation:** Use triple buffering for optimal performance:
- One buffer is used for frame reconstruction.
- One buffer is processed by the user.
- One buffer is queued for use.
- **Frame Size Considerations:**
- UVC cameras report the maximum frame buffer size during stream format negotiation.
- These sizes are often overly large, leading to inefficient RAM usage.
- This driver allows the allocation of smaller FBs to optimize memory usage.

### Frame buffer state transitions
![Frame buffer state transitions](./uvc_frames_state_transitions.png)
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Loading