From patchwork Tue Jul 18 18:52:51 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= X-Patchwork-Id: 42805 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:b813:b0:130:ccc6:6c4b with SMTP id fi19csp697767pzb; Tue, 18 Jul 2023 11:53:05 -0700 (PDT) X-Google-Smtp-Source: APBJJlFllCBBHcO3AcOq6j2eFWIiGTLbktknx+DFRgf4tjJEGWq3XS5RKjVdLxhAMoceoDyWHXOZ X-Received: by 2002:ac2:4e05:0:b0:4f8:7513:8cac with SMTP id e5-20020ac24e05000000b004f875138cacmr11831664lfr.48.1689706385048; Tue, 18 Jul 2023 11:53:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689706385; cv=none; d=google.com; s=arc-20160816; b=GDKUZ8wEqes66LFSK719TmKskqaLtvqNbd3NfR54u07d+KkOYwriyR9wn4+HsrUB2x PCBTPnqWymrKRsIpfNj+7mIw/MtyPjvMo/W+YhuvUcJvvaDP4JCHKOkZdyaMuJMCDyuQ c5mRO/34inW7kk68hbH2Bx2ik4uZlgY/vZxGezGTit7GZsl1BptbpAsi3rFYCNmuy/Vo ARv7fC7uJpcZyJ8eaGrt+mn/7+ziu/wSTMwws2A/GjCkjb2t70WAHEyhs6TtauaGj4Sf pUHOpD4wZFU04vGxVCooF9keCd2VDy6qYJfG/cqBH7RRPZzedY4Hk8+Q1YbXYeKj/APb NRCg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:message-id:date:to:from :delivered-to; bh=p9KZqU02sa9WNNlWYZrabOq+p/hKt3Uud/iXksCKx1U=; fh=hQcp50obTJ8bXC1it5NuEN23RGKfx0/zZ3s2gmreL+A=; b=siCseiJROkAYht3DS+KxmNsjAHz5JwLlwLrwIE9e5u8BWbuND7f+znbq0jOc1z1QYi gj9zs8M8YMYAExEazszmx6O/Pz/c5rtqQdmH0z5XdlRhZPXOZmlDpsYphV36/G2qveQH mjhuww7aayNWTHLGzJCDCYrE3J8HZ9Rzv0/x3lFcKVMJ/2qX4OyTnZXcCcoGW85Md4Ar u0h1TLIg2IyueP/ckzcBeu2xnYmcEMDF5OpBdSW+/+4khzStLewtsk2JhSoYyUfRmNes G2Y/dMcIUCLyTodW3vV0X9cfwoV8glJoWoy88IgJCeGTnp4/RXP0j8qoLKQhZGo0o/kT g5Mg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id k19-20020a05640212d300b0051df52aa65bsi1794105edx.155.2023.07.18.11.53.04; Tue, 18 Jul 2023 11:53:05 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 5E22668C504; Tue, 18 Jul 2023 21:53:00 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 8EADF680156 for ; Tue, 18 Jul 2023 21:52:54 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id E6200C0090 for ; Tue, 18 Jul 2023 21:52:53 +0300 (EEST) From: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= To: ffmpeg-devel@ffmpeg.org Date: Tue, 18 Jul 2023 21:52:51 +0300 Message-Id: <20230718185253.18927-1-remi@remlab.net> X-Mailer: git-send-email 2.40.1 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 1/3] swscale: inline ff_shuffle_bytes_3210_rvv X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: e4jQjhCpXz/0 No functional changes. --- libswscale/riscv/rgb2rgb_rvv.S | 19 ++++++++++++++++++- 1 file changed, 18 insertions(+), 1 deletion(-) diff --git a/libswscale/riscv/rgb2rgb_rvv.S b/libswscale/riscv/rgb2rgb_rvv.S index bbdfdbebbc..8f30d760cb 100644 --- a/libswscale/riscv/rgb2rgb_rvv.S +++ b/libswscale/riscv/rgb2rgb_rvv.S @@ -74,7 +74,24 @@ func ff_shuffle_bytes_3210_rvv, zve32x addi t2, a0, 1 addi t3, a0, 0 addi a0, a0, 3 - j 1b + srai a2, a2, 2 + li t4, 4 +1: + vsetvli t0, a2, e8, m1, ta, ma + sub a2, a2, t0 + vlse8.v v8, (a0), t4 + sh2add a0, t0, a0 + vlse8.v v9, (t1), t4 + sh2add t1, t0, t1 + vlse8.v v10, (t2), t4 + sh2add t2, t0, t2 + vlse8.v v11, (t3), t4 + sh2add t3, t0, t3 + vsseg4e8.v v8, (a1) + sh2add a1, t0, a1 + bnez a2, 1b + + ret endfunc func ff_interleave_bytes_rvv, zve32x