From patchwork Sun Sep 1 13:20:20 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Lance Wang X-Patchwork-Id: 14830 Return-Path: X-Original-To: patchwork@ffaux-bg.ffmpeg.org Delivered-To: patchwork@ffaux-bg.ffmpeg.org Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by ffaux.localdomain (Postfix) with ESMTP id 4792244A6FF for ; Sun, 1 Sep 2019 16:20:44 +0300 (EEST) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 2D15B680093; Sun, 1 Sep 2019 16:20:44 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-pl1-f194.google.com (mail-pl1-f194.google.com [209.85.214.194]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id B9DC268049A for ; Sun, 1 Sep 2019 16:20:37 +0300 (EEST) Received: by mail-pl1-f194.google.com with SMTP id y1so5354667plp.9 for ; Sun, 01 Sep 2019 06:20:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=KwVWA1HbbwvX56LCCNiSx/c8DWXigunpHitD4N5HAOw=; b=abswwY2WbibU1OHvGRzuOASrMC3RQQu1CVtkFKbHw/W+3FlCEmW4XfvezqCKcxcfU2 bEHjicIEIa1ojMRYIC77eMzp017phpuPlXhJ7TaqmgQbEkRPyAecx3PWa++PxJtvrRF0 Afb9zmauaWD76T41Wi2fKwnP7fOeIv9exDLNPGjegty2PMBSb4kYiRG6Eq5cC1z4Rky1 P6qNgF53WQJy/yIjNFXl6N1nBWE3NbzRK+viLHL7cYgZX4eK9Nr10JnXxchM0HkZrzZM 09MSvqixxnhXirDbM7/dZu/HDvXGaA3FGJG2QFhxlggUf7znefqsRs3aDzEOf1UVTbAT mHJA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=KwVWA1HbbwvX56LCCNiSx/c8DWXigunpHitD4N5HAOw=; b=GZuH9T9rLJw3VqdK3F4caRigcL/U/iKEVzKLGlVAzdZP6JABj0ZdZ8dArC+Nre++Xf A0IaQAJmg1G/6xDDQKM5xvhjrLvby8gpJRJkQ3HuuKUSLbp1DTkxGQkPHyRGdeJfTB+b vRmkPok/zQWDBMXxWLC/ftOd3LMyvTVIORRxgfNNuJQyNd6fIfylM/BL9Kj7ykjFxRTU 8W0zC6ar8LtM2K3f401xxUeEC49fatABd5eXvvfKOftM9Vmrl/kvnawNPT34zXcgnB4q LD0R8+5JKZrNo+epaOrL77xhEk5AhUbTVrsfEYcVBIbuikAVZiSKeangrVJ8AxKm32Jk QQBg== X-Gm-Message-State: APjAAAWJg2qk2j9j2+Bku7+o/j/4h2ikXphuQrz22AohD8SywCVpnuu6 X09i07E6dMjK3C2AxNO9Mtd2XSZG X-Google-Smtp-Source: APXvYqxYadaKCSHovsjSr/WHsVo79o25wXCofZdvEUeYH1+ollXI8FR45igY4ip3qHTKW5BmklhSLA== X-Received: by 2002:a17:902:7588:: with SMTP id j8mr7738097pll.280.1567344036070; Sun, 01 Sep 2019 06:20:36 -0700 (PDT) Received: from vpn.localdomain ([47.90.99.151]) by smtp.gmail.com with ESMTPSA id b14sm12587039pfo.15.2019.09.01.06.20.34 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 01 Sep 2019 06:20:35 -0700 (PDT) From: lance.lmwang@gmail.com To: ffmpeg-devel@ffmpeg.org Date: Sun, 1 Sep 2019 21:20:20 +0800 Message-Id: <20190901132023.28531-2-lance.lmwang@gmail.com> X-Mailer: git-send-email 2.9.5 In-Reply-To: <20190901132023.28531-1-lance.lmwang@gmail.com> References: <1567007116-9088-1-git-send-email-lance.lmwang@gmail.com> <20190901132023.28531-1-lance.lmwang@gmail.com> Subject: [FFmpeg-devel] [PATCH v3 2/5] avcodec/v210enc: make 8bit and 10bit function consistent X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Limin Wang MIME-Version: 1.0 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" From: Limin Wang I have benchmarked the performance with c code and haven't see any performance impact. Signed-off-by: Limin Wang --- libavcodec/v210enc.c | 7 +------ 1 file changed, 1 insertion(+), 6 deletions(-) diff --git a/libavcodec/v210enc.c b/libavcodec/v210enc.c index 1b840b2..69a2efe 100644 --- a/libavcodec/v210enc.c +++ b/libavcodec/v210enc.c @@ -43,12 +43,7 @@ static void v210_planar_pack_8_c(const uint8_t *y, const uint8_t *u, uint32_t val; int i; - /* unroll this to match the assembly */ - for (i = 0; i < width - 11; i += 12) { - WRITE_PIXELS(u, y, v, 8); - WRITE_PIXELS(y, u, y, 8); - WRITE_PIXELS(v, y, u, 8); - WRITE_PIXELS(y, v, y, 8); + for (i = 0; i < width - 5; i += 6) { WRITE_PIXELS(u, y, v, 8); WRITE_PIXELS(y, u, y, 8); WRITE_PIXELS(v, y, u, 8);