From patchwork Sun Sep 1 13:20:19 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Lance Wang X-Patchwork-Id: 14829 Return-Path: X-Original-To: patchwork@ffaux-bg.ffmpeg.org Delivered-To: patchwork@ffaux-bg.ffmpeg.org Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by ffaux.localdomain (Postfix) with ESMTP id 2E79544A6FF for ; Sun, 1 Sep 2019 16:20:41 +0300 (EEST) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id E30DF680508; Sun, 1 Sep 2019 16:20:40 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-pl1-f193.google.com (mail-pl1-f193.google.com [209.85.214.193]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 6D790680373 for ; Sun, 1 Sep 2019 16:20:33 +0300 (EEST) Received: by mail-pl1-f193.google.com with SMTP id az1so4413411plb.6 for ; Sun, 01 Sep 2019 06:20:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=7kCmqqytV3OBjRruSPyKQy6JK7x7QZsXYxmDESMKc3w=; b=b6nLJ7aVIkuk6PCR1gVQrNl1GQYXA9jXRm+xAvgMPdpEIYhMckRdAaCA/V/0VxlxfZ ABVbIf6cOCRMI7hGTbKy4GqIYyHykOsfKcrVGltEpxz7aWFrt/vTWkQR7QO0WbFWav+h rYP4fEJeP9Vz7/G2uMAYNLanwJUARbzla7j36+B1EEXAVXJ//P9H0QsbR1b3m3nNsQOU ZOopGRzcDlO7lJ/9Nrt9NO3aL7mgPHzxyK72DDMy5a65kEA+0y0LDLg0Vvu8sj++SZmA E+0gPuFu7jylhzTb3sBHQ7WMOJ3wOYUGZCN//oElyHScY9qMfvGe2mQKHamURZI7Ns8I pkRw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=7kCmqqytV3OBjRruSPyKQy6JK7x7QZsXYxmDESMKc3w=; b=oKFbT+YnC1+FsrSD1DvyTidK4k7mHNnvTmW2thWLUPhZgbxZt1Ilrr5jfZXnXnSUFE ly1YFLqDaQWub0HXNlzeGJQwkL+lyjtUVx38sdMbHPEWVInw8+RR1R1OMqB8TUVIrGhu mqZS8rQxC7Jtiob4ru9StSIDciQSTBcmOhp/nOtGCn2n21/hg8GQ/VAz3EQYmGVcB7b3 Iiew2ISGEy0HdNudDZge31o5J37FIGjB1+oaUE0M804H4qzz9OZtUkFogSzC03ULAUXs 5xj+KwleJVPycYeE1QBvp6UPCn2+/c0LXwpw4PmFnjdVyemAgkfFNagXMkep4+qongdh CCAQ== X-Gm-Message-State: APjAAAXHsZU/cJWkqXRuK8u8efGxgredciiMpAW60IK+Ld/0OIlamFXR 6EOkf7TN/5Z5DtiDK1+a8HcmuMCy X-Google-Smtp-Source: APXvYqyhMfmSLnmq4yJfzL6MUeapVTSPL36P8pAb6jtdjCgwPZc5sGwfohHZPqCyqnjEImsRCs6rLA== X-Received: by 2002:a17:902:aa8a:: with SMTP id d10mr24207217plr.265.1567344031232; Sun, 01 Sep 2019 06:20:31 -0700 (PDT) Received: from vpn.localdomain ([47.90.99.151]) by smtp.gmail.com with ESMTPSA id b14sm12587039pfo.15.2019.09.01.06.20.29 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 01 Sep 2019 06:20:30 -0700 (PDT) From: lance.lmwang@gmail.com To: ffmpeg-devel@ffmpeg.org Date: Sun, 1 Sep 2019 21:20:19 +0800 Message-Id: <20190901132023.28531-1-lance.lmwang@gmail.com> X-Mailer: git-send-email 2.9.5 In-Reply-To: <1567007116-9088-1-git-send-email-lance.lmwang@gmail.com> References: <1567007116-9088-1-git-send-email-lance.lmwang@gmail.com> Subject: [FFmpeg-devel] [PATCH v3 1/5] avcodec/v210enc: add depth parameter for WRITE_PIXELS and CLIP X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Limin Wang MIME-Version: 1.0 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" From: Limin Wang Signed-off-by: Limin Wang --- libavcodec/v210enc.c | 83 +++++++++++++++++++++++----------------------------- 1 file changed, 36 insertions(+), 47 deletions(-) diff --git a/libavcodec/v210enc.c b/libavcodec/v210enc.c index b024806..1b840b2 100644 --- a/libavcodec/v210enc.c +++ b/libavcodec/v210enc.c @@ -26,25 +26,14 @@ #include "internal.h" #include "v210enc.h" -#define CLIP(v) av_clip(v, 4, 1019) -#define CLIP8(v) av_clip(v, 1, 254) - -#define WRITE_PIXELS(a, b, c) \ - do { \ - val = CLIP(*a++); \ - val |= (CLIP(*b++) << 10) | \ - (CLIP(*c++) << 20); \ - AV_WL32(dst, val); \ - dst += 4; \ - } while (0) - -#define WRITE_PIXELS8(a, b, c) \ - do { \ - val = (CLIP8(*a++) << 2); \ - val |= (CLIP8(*b++) << 12) | \ - (CLIP8(*c++) << 22); \ - AV_WL32(dst, val); \ - dst += 4; \ +#define CLIP(v, depth) av_clip(v, 1 << (depth-8), ((1 << depth)-(1 << (depth-8))-1)) +#define WRITE_PIXELS(a, b, c, depth) \ + do { \ + val = CLIP(*a++, depth) << (10-depth); \ + val |= (CLIP(*b++, depth) << (20-depth)) | \ + (CLIP(*c++, depth) << (30-depth)); \ + AV_WL32(dst, val); \ + dst += 4; \ } while (0) static void v210_planar_pack_8_c(const uint8_t *y, const uint8_t *u, @@ -56,14 +45,14 @@ static void v210_planar_pack_8_c(const uint8_t *y, const uint8_t *u, /* unroll this to match the assembly */ for (i = 0; i < width - 11; i += 12) { - WRITE_PIXELS8(u, y, v); - WRITE_PIXELS8(y, u, y); - WRITE_PIXELS8(v, y, u); - WRITE_PIXELS8(y, v, y); - WRITE_PIXELS8(u, y, v); - WRITE_PIXELS8(y, u, y); - WRITE_PIXELS8(v, y, u); - WRITE_PIXELS8(y, v, y); + WRITE_PIXELS(u, y, v, 8); + WRITE_PIXELS(y, u, y, 8); + WRITE_PIXELS(v, y, u, 8); + WRITE_PIXELS(y, v, y, 8); + WRITE_PIXELS(u, y, v, 8); + WRITE_PIXELS(y, u, y, 8); + WRITE_PIXELS(v, y, u, 8); + WRITE_PIXELS(y, v, y, 8); } } @@ -75,10 +64,10 @@ static void v210_planar_pack_10_c(const uint16_t *y, const uint16_t *u, int i; for (i = 0; i < width - 5; i += 6) { - WRITE_PIXELS(u, y, v); - WRITE_PIXELS(y, u, y); - WRITE_PIXELS(v, y, u); - WRITE_PIXELS(y, v, y); + WRITE_PIXELS(u, y, v, 10); + WRITE_PIXELS(y, u, y, 10); + WRITE_PIXELS(v, y, u, 10); + WRITE_PIXELS(y, v, y, 10); } } @@ -153,26 +142,26 @@ static int encode_frame(AVCodecContext *avctx, AVPacket *pkt, dst += sample_w * 16 * s->sample_factor_10; for (; w < avctx->width - 5; w += 6) { - WRITE_PIXELS(u, y, v); - WRITE_PIXELS(y, u, y); - WRITE_PIXELS(v, y, u); - WRITE_PIXELS(y, v, y); + WRITE_PIXELS(u, y, v, 10); + WRITE_PIXELS(y, u, y, 10); + WRITE_PIXELS(v, y, u, 10); + WRITE_PIXELS(y, v, y, 10); } if (w < avctx->width - 1) { - WRITE_PIXELS(u, y, v); + WRITE_PIXELS(u, y, v, 10); - val = CLIP(*y++); + val = CLIP(*y++, 10); if (w == avctx->width - 2) { AV_WL32(dst, val); dst += 4; } } if (w < avctx->width - 3) { - val |= (CLIP(*u++) << 10) | (CLIP(*y++) << 20); + val |= (CLIP(*u++, 10) << 10) | (CLIP(*y++, 10) << 20); AV_WL32(dst, val); dst += 4; - val = CLIP(*v++) | (CLIP(*y++) << 10); + val = CLIP(*v++, 10) | (CLIP(*y++, 10) << 10); AV_WL32(dst, val); dst += 4; } @@ -202,26 +191,26 @@ static int encode_frame(AVCodecContext *avctx, AVPacket *pkt, dst += sample_w * 32 * s->sample_factor_8; for (; w < avctx->width - 5; w += 6) { - WRITE_PIXELS8(u, y, v); - WRITE_PIXELS8(y, u, y); - WRITE_PIXELS8(v, y, u); - WRITE_PIXELS8(y, v, y); + WRITE_PIXELS(u, y, v, 8); + WRITE_PIXELS(y, u, y, 8); + WRITE_PIXELS(v, y, u, 8); + WRITE_PIXELS(y, v, y, 8); } if (w < avctx->width - 1) { - WRITE_PIXELS8(u, y, v); + WRITE_PIXELS(u, y, v, 8); - val = CLIP8(*y++) << 2; + val = CLIP(*y++, 8) << 2; if (w == avctx->width - 2) { AV_WL32(dst, val); dst += 4; } } if (w < avctx->width - 3) { - val |= (CLIP8(*u++) << 12) | (CLIP8(*y++) << 22); + val |= (CLIP(*u++, 8) << 12) | (CLIP(*y++, 8) << 22); AV_WL32(dst, val); dst += 4; - val = (CLIP8(*v++) << 2) | (CLIP8(*y++) << 12); + val = (CLIP(*v++, 8) << 2) | (CLIP(*y++, 8) << 12); AV_WL32(dst, val); dst += 4; }