From patchwork Wed Apr 17 02:08:49 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jarek Samic X-Patchwork-Id: 12774 Return-Path: X-Original-To: patchwork@ffaux-bg.ffmpeg.org Delivered-To: patchwork@ffaux-bg.ffmpeg.org Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by ffaux.localdomain (Postfix) with ESMTP id 86B1E44821E for ; Wed, 17 Apr 2019 05:17:21 +0300 (EEST) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 680E768A8F3; Wed, 17 Apr 2019 05:17:21 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-it1-f196.google.com (mail-it1-f196.google.com [209.85.166.196]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id D669E68A758 for ; Wed, 17 Apr 2019 05:17:14 +0300 (EEST) Received: by mail-it1-f196.google.com with SMTP id q14so2114351itk.0 for ; Tue, 16 Apr 2019 19:17:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=T0xMSTZUsbGoLmFWr3bcp6quuS7MRRs9ZOne5ju48pY=; b=TBdkYEYfNFaAovAo8+UW9HSwMgk9K7GfmMdNFen4Ko8lLTdqS05yWUi+VlMBGUy0k3 N330x174n8/pZchB6P/2mMlBYxjduuZr/hAAPLevGhfAkqL1Jw0SbNRqA8T5+hiAGVPJ tcb7MiC/Qv+oUd7Ht4Ax3h09XW7PZdNv2O74BrwWzkiqGj/HjyaPw25fyQUAbpnV4hfe odzWCoBm8rOC/Nv9UbaL1BYENOn92bNqTz2xt4GUFXGn8yuLjb59qLJFxIfTWYT6Qxva FOdqHHB1W1xAMM84C/GjsyBOj1ML6Xr6micX6f//Yo95U9lBqDGsXd6GaMIpd5fLR/mr jMGQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=T0xMSTZUsbGoLmFWr3bcp6quuS7MRRs9ZOne5ju48pY=; b=OUxT3XLFy47p2Gl/JHXov0difsSyxfkp1+P78CsYEZZXIJfq5djhYzyjHD0+nQiPNx 0lTbL6sv4MFYPeDcXuVOGOMsaT8uRCYHgdRT+HZMepNlYpx42leksXIYGi8bXPGxW8TD OZa12i1ZesVRm4pFPK3EUlIIunm/+YIrK5uE5S4eNVqtjlnJbt4tuc+IsfY6fcGtFUGQ 4Yv2s1AMFePHsOS/eTTH0ty3aSFDc8HpJHe1rmV7Mn9iXUC8FtbLfutSIHvv9dezqHY2 vhyEAB9cV3P6aKquLW3jGsdzzfnegIaMnYXZD4gEJ6rIXGCJPF9d2d0vcrbxCl4GgiHr d5Eg== X-Gm-Message-State: APjAAAWDY1X8DCxta/XBqvZs/n9Scmnk7BTAQzQ2YGgjnZJNJ3le72qK bavU7eruTD2Hu7MV5SjB4rKbL9a9zPQ= X-Google-Smtp-Source: APXvYqz9m9rU/gND3ZjmdIJ+EWUG+owxfLSqiaz/tGt6iRORDIWtWOeDvM8boaVwHPObqQZgbkSMFA== X-Received: by 2002:a24:554e:: with SMTP id e75mr28134519itb.151.1555466971717; Tue, 16 Apr 2019 19:09:31 -0700 (PDT) Received: from cldire-arch.stormhome.local (rrcs-70-61-229-139.central.biz.rr.com. [70.61.229.139]) by smtp.gmail.com with ESMTPSA id f9sm19964930ioo.24.2019.04.16.19.09.30 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 16 Apr 2019 19:09:30 -0700 (PDT) From: Jarek Samic To: ffmpeg-devel@ffmpeg.org Date: Tue, 16 Apr 2019 22:08:49 -0400 Message-Id: <20190417020849.21220-1-cldfire3@gmail.com> X-Mailer: git-send-email 2.21.0 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH v3] lavfi: add colorkey_opencl filter X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Jarek Samic Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" This is a direct port of the CPU filter. Signed-off-by: Jarek Samic --- More fixes based on the comments from the second version of the patch (moving sampler declaration into the program scope, `f`-suffixing constants, attaching the `*` sigil to the variable name rather than the data type). configure | 1 + doc/filters.texi | 33 +++++ libavfilter/Makefile | 2 + libavfilter/allfilters.c | 1 + libavfilter/opencl/colorkey.cl | 49 +++++++ libavfilter/opencl_source.h | 1 + libavfilter/vf_colorkey_opencl.c | 244 +++++++++++++++++++++++++++++++ 7 files changed, 331 insertions(+) create mode 100644 libavfilter/opencl/colorkey.cl create mode 100644 libavfilter/vf_colorkey_opencl.c diff --git a/configure b/configure index e10e2c2c46..ac59c4ddec 100755 --- a/configure +++ b/configure @@ -3413,6 +3413,7 @@ boxblur_filter_deps="gpl" boxblur_opencl_filter_deps="opencl gpl" bs2b_filter_deps="libbs2b" colormatrix_filter_deps="gpl" +colorkey_opencl_filter_deps="opencl" convolution_opencl_filter_deps="opencl" convolve_filter_deps="avcodec" convolve_filter_select="fft" diff --git a/doc/filters.texi b/doc/filters.texi index 867607d870..390c8b97cf 100644 --- a/doc/filters.texi +++ b/doc/filters.texi @@ -19030,6 +19030,39 @@ Apply erosion filter with threshold0 set to 30, threshold1 set 40, threshold2 se @end example @end itemize +@section colorkey_opencl +RGB colorspace color keying. + +The filter accepts the following options: + +@table @option +@item color +The color which will be replaced with transparency. + +@item similarity +Similarity percentage with the key color. + +0.01 matches only the exact key color, while 1.0 matches everything. + +@item blend +Blend percentage. + +0.0 makes pixels either fully transparent, or not transparent at all. + +Higher values result in semi-transparent pixels, with a higher transparency +the more similar the pixels color is to the key color. +@end table + +@subsection Examples + +@itemize +@item +Make every semi-green pixel in the input transparent with some slight blending: +@example +-i INPUT -vf "hwupload, colorkey_opencl=green:0.3:0.1, hwdownload" OUTPUT +@end example +@end itemize + @section overlay_opencl Overlay one video on top of another. diff --git a/libavfilter/Makefile b/libavfilter/Makefile index fef6ec5c55..9589dd8747 100644 --- a/libavfilter/Makefile +++ b/libavfilter/Makefile @@ -176,6 +176,8 @@ OBJS-$(CONFIG_CODECVIEW_FILTER) += vf_codecview.o OBJS-$(CONFIG_COLORBALANCE_FILTER) += vf_colorbalance.o OBJS-$(CONFIG_COLORCHANNELMIXER_FILTER) += vf_colorchannelmixer.o OBJS-$(CONFIG_COLORKEY_FILTER) += vf_colorkey.o +OBJS-$(CONFIG_COLORKEY_OPENCL_FILTER) += vf_colorkey_opencl.o opencl.o \ + opencl/colorkey.o OBJS-$(CONFIG_COLORLEVELS_FILTER) += vf_colorlevels.o OBJS-$(CONFIG_COLORMATRIX_FILTER) += vf_colormatrix.o OBJS-$(CONFIG_COLORSPACE_FILTER) += vf_colorspace.o colorspace.o colorspacedsp.o diff --git a/libavfilter/allfilters.c b/libavfilter/allfilters.c index c51ae0f3c7..ff4eb5bf6b 100644 --- a/libavfilter/allfilters.c +++ b/libavfilter/allfilters.c @@ -165,6 +165,7 @@ extern AVFilter ff_vf_codecview; extern AVFilter ff_vf_colorbalance; extern AVFilter ff_vf_colorchannelmixer; extern AVFilter ff_vf_colorkey; +extern AVFilter ff_vf_colorkey_opencl; extern AVFilter ff_vf_colorlevels; extern AVFilter ff_vf_colormatrix; extern AVFilter ff_vf_colorspace; diff --git a/libavfilter/opencl/colorkey.cl b/libavfilter/opencl/colorkey.cl new file mode 100644 index 0000000000..6d71f17164 --- /dev/null +++ b/libavfilter/opencl/colorkey.cl @@ -0,0 +1,49 @@ +/* + * This file is part of FFmpeg. + * + * FFmpeg is free software; you can redistribute it and/or + * modify it under the terms of the GNU Lesser General Public + * License as published by the Free Software Foundation; either + * version 2.1 of the License, or (at your option) any later version. + * + * FFmpeg is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + * Lesser General Public License for more details. + * + * You should have received a copy of the GNU Lesser General Public + * License along with FFmpeg; if not, write to the Free Software + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA + */ + +const sampler_t sampler = CLK_NORMALIZED_COORDS_FALSE | + CLK_FILTER_NEAREST; + +__kernel void colorkey_blend( + __read_only image2d_t src, + __write_only image2d_t dst, + float4 colorkey_rgba, + float similarity, + float blend +) { + int2 loc = (int2)(get_global_id(0), get_global_id(1)); + float4 pixel = read_imagef(src, sampler, loc); + float diff = distance(pixel.xyz, colorkey_rgba.xyz); + + pixel.s3 = clamp((diff - similarity) / blend, 0.0f, 1.0f); + write_imagef(dst, loc, pixel); +} + +__kernel void colorkey( + __read_only image2d_t src, + __write_only image2d_t dst, + float4 colorkey_rgba, + float similarity +) { + int2 loc = (int2)(get_global_id(0), get_global_id(1)); + float4 pixel = read_imagef(src, sampler, loc); + float diff = distance(pixel.xyz, colorkey_rgba.xyz); + + pixel.s3 = (diff > similarity) ? 1.0f : 0.0f; + write_imagef(dst, loc, pixel); +} diff --git a/libavfilter/opencl_source.h b/libavfilter/opencl_source.h index 4118138c30..51f7178cf2 100644 --- a/libavfilter/opencl_source.h +++ b/libavfilter/opencl_source.h @@ -20,6 +20,7 @@ #define AVFILTER_OPENCL_SOURCE_H extern const char *ff_opencl_source_avgblur; +extern const char *ff_opencl_source_colorkey; extern const char *ff_opencl_source_colorspace_common; extern const char *ff_opencl_source_convolution; extern const char *ff_opencl_source_neighbor; diff --git a/libavfilter/vf_colorkey_opencl.c b/libavfilter/vf_colorkey_opencl.c new file mode 100644 index 0000000000..46a0454fbd --- /dev/null +++ b/libavfilter/vf_colorkey_opencl.c @@ -0,0 +1,244 @@ +/* + * This file is part of FFmpeg. + * + * FFmpeg is free software; you can redistribute it and/or + * modify it under the terms of the GNU Lesser General Public + * License as published by the Free Software Foundation; either + * version 2.1 of the License, or (at your option) any later version. + * + * FFmpeg is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + * Lesser General Public License for more details. + * + * You should have received a copy of the GNU Lesser General Public + * License along with FFmpeg; if not, write to the Free Software + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA + */ + +#include "libavutil/opt.h" +#include "libavutil/imgutils.h" +#include "avfilter.h" +#include "formats.h" +#include "internal.h" +#include "opencl.h" +#include "opencl_source.h" +#include "video.h" + +typedef struct ColorkeyOpenCLContext { + OpenCLFilterContext ocf; + // Whether or not the above `OpenCLFilterContext` has been initialized + int initialized; + + cl_command_queue command_queue; + cl_kernel kernel_colorkey; + + // The color we are supposed to replace with transparency + uint8_t colorkey_rgba[4]; + // Stored as a normalized float for passing to the OpenCL kernel + cl_float4 colorkey_rgba_float; + // Similarity percentage compared to `colorkey_rgba`, ranging from `0.01` to `1.0` + // where `0.01` matches only the key color and `1.0` matches all colors + float similarity; + // Blending percentage where `0.0` results in fully transparent pixels, `1.0` results + // in fully opaque pixels, and numbers in between result in transparency that varies + // based on the similarity to the key color + float blend; +} ColorkeyOpenCLContext; + +static int colorkey_opencl_init(AVFilterContext *avctx) +{ + ColorkeyOpenCLContext *ctx = avctx->priv; + cl_int cle; + int err; + + err = ff_opencl_filter_load_program(avctx, &ff_opencl_source_colorkey, 1); + if (err < 0) + goto fail; + + ctx->command_queue = clCreateCommandQueue( + ctx->ocf.hwctx->context, + ctx->ocf.hwctx->device_id, + 0, + &cle + ); + + CL_FAIL_ON_ERROR(AVERROR(EIO), "Failed to create OpenCL command queue %d.\n", cle); + + if (ctx->blend > 0.0001) { + ctx->kernel_colorkey = clCreateKernel(ctx->ocf.program, "colorkey_blend", &cle); + CL_FAIL_ON_ERROR(AVERROR(EIO), "Failed to create colorkey_blend kernel: %d.\n", cle); + } else { + ctx->kernel_colorkey = clCreateKernel(ctx->ocf.program, "colorkey", &cle); + CL_FAIL_ON_ERROR(AVERROR(EIO), "Failed to create colorkey kernel: %d.\n", cle); + } + + for (int i = 0; i < 4; ++i) { + ctx->colorkey_rgba_float.s[i] = (float)ctx->colorkey_rgba[i] / 255.0; + } + + ctx->initialized = 1; + return 0; + +fail: + if (ctx->command_queue) + clReleaseCommandQueue(ctx->command_queue); + if (ctx->kernel_colorkey) + clReleaseKernel(ctx->kernel_colorkey); + return err; +} + +static int filter_frame(AVFilterLink *link, AVFrame *input_frame) +{ + AVFilterContext *avctx = link->dst; + AVFilterLink *outlink = avctx->outputs[0]; + ColorkeyOpenCLContext *colorkey_ctx = avctx->priv; + AVFrame *output_frame = NULL; + int err; + cl_int cle; + size_t global_work[2]; + cl_mem src, dst; + + if (!input_frame->hw_frames_ctx) + return AVERROR(EINVAL); + + if (!colorkey_ctx->initialized) { + AVHWFramesContext *input_frames_ctx = + (AVHWFramesContext*)input_frame->hw_frames_ctx->data; + int fmt = input_frames_ctx->sw_format; + + // Make sure the input is a format we support + if (fmt != AV_PIX_FMT_ARGB && + fmt != AV_PIX_FMT_RGBA && + fmt != AV_PIX_FMT_ABGR && + fmt != AV_PIX_FMT_BGRA + ) { + av_log(avctx, AV_LOG_ERROR, "unsupported (non-RGB) format in colorkey_opencl.\n"); + err = AVERROR(ENOSYS); + goto fail; + } + + err = colorkey_opencl_init(avctx); + if (err < 0) + goto fail; + } + + // This filter only operates on RGB data and we know that will be on the first plane + src = (cl_mem)input_frame->data[0]; + output_frame = ff_get_video_buffer(outlink, outlink->w, outlink->h); + if (!output_frame) { + err = AVERROR(ENOMEM); + goto fail; + } + dst = (cl_mem)output_frame->data[0]; + + CL_SET_KERNEL_ARG(colorkey_ctx->kernel_colorkey, 0, cl_mem, &src); + CL_SET_KERNEL_ARG(colorkey_ctx->kernel_colorkey, 1, cl_mem, &dst); + CL_SET_KERNEL_ARG(colorkey_ctx->kernel_colorkey, 2, cl_float4, &colorkey_ctx->colorkey_rgba_float); + CL_SET_KERNEL_ARG(colorkey_ctx->kernel_colorkey, 3, float, &colorkey_ctx->similarity); + if (colorkey_ctx->blend > 0.0001) { + CL_SET_KERNEL_ARG(colorkey_ctx->kernel_colorkey, 4, float, &colorkey_ctx->blend); + } + + err = ff_opencl_filter_work_size_from_image(avctx, global_work, input_frame, 0, 0); + if (err < 0) + goto fail; + + cle = clEnqueueNDRangeKernel( + colorkey_ctx->command_queue, + colorkey_ctx->kernel_colorkey, + 2, + NULL, + global_work, + NULL, + 0, + NULL, + NULL + ); + + CL_FAIL_ON_ERROR(AVERROR(EIO), "Failed to enqueue colorkey kernel: %d.\n", cle); + + // Run queued kernel + cle = clFinish(colorkey_ctx->command_queue); + CL_FAIL_ON_ERROR(AVERROR(EIO), "Failed to finish command queue: %d.\n", cle); + + err = av_frame_copy_props(output_frame, input_frame); + if (err < 0) + goto fail; + + av_frame_free(&input_frame); + + return ff_filter_frame(outlink, output_frame); + +fail: + clFinish(colorkey_ctx->command_queue); + av_frame_free(&input_frame); + av_frame_free(&output_frame); + return err; +} + +static av_cold void colorkey_opencl_uninit(AVFilterContext *avctx) +{ + ColorkeyOpenCLContext *ctx = avctx->priv; + cl_int cle; + + if (ctx->kernel_colorkey) { + cle = clReleaseKernel(ctx->kernel_colorkey); + if (cle != CL_SUCCESS) + av_log(avctx, AV_LOG_ERROR, "Failed to release " + "kernel: %d.\n", cle); + } + + if (ctx->command_queue) { + cle = clReleaseCommandQueue(ctx->command_queue); + if (cle != CL_SUCCESS) + av_log(avctx, AV_LOG_ERROR, "Failed to release " + "command queue: %d.\n", cle); + } + + ff_opencl_filter_uninit(avctx); +} + +static const AVFilterPad colorkey_opencl_inputs[] = { + { + .name = "default", + .type = AVMEDIA_TYPE_VIDEO, + .filter_frame = filter_frame, + .config_props = &ff_opencl_filter_config_input, + }, + { NULL } +}; + +static const AVFilterPad colorkey_opencl_outputs[] = { + { + .name = "default", + .type = AVMEDIA_TYPE_VIDEO, + .config_props = &ff_opencl_filter_config_output, + }, + { NULL } +}; + +#define OFFSET(x) offsetof(ColorkeyOpenCLContext, x) +#define FLAGS AV_OPT_FLAG_FILTERING_PARAM|AV_OPT_FLAG_VIDEO_PARAM + +static const AVOption colorkey_opencl_options[] = { + { "color", "set the colorkey key color", OFFSET(colorkey_rgba), AV_OPT_TYPE_COLOR, { .str = "black" }, CHAR_MIN, CHAR_MAX, FLAGS }, + { "similarity", "set the colorkey similarity value", OFFSET(similarity), AV_OPT_TYPE_FLOAT, { .dbl = 0.01 }, 0.01, 1.0, FLAGS }, + { "blend", "set the colorkey key blend value", OFFSET(blend), AV_OPT_TYPE_FLOAT, { .dbl = 0.0 }, 0.0, 1.0, FLAGS }, + { NULL } +}; + +AVFILTER_DEFINE_CLASS(colorkey_opencl); + +AVFilter ff_vf_colorkey_opencl = { + .name = "colorkey_opencl", + .description = NULL_IF_CONFIG_SMALL("Turns a certain color into transparency. Operates on RGB colors."), + .priv_size = sizeof(ColorkeyOpenCLContext), + .priv_class = &colorkey_opencl_class, + .init = &ff_opencl_filter_init, + .uninit = &colorkey_opencl_uninit, + .query_formats = &ff_opencl_filter_query_formats, + .inputs = colorkey_opencl_inputs, + .outputs = colorkey_opencl_outputs, + .flags_internal = FF_FILTER_FLAG_HWFRAME_AWARE +};