From patchwork Mon Jul 22 18:44:24 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= X-Patchwork-Id: 50684 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a59:a742:0:b0:482:c625:d099 with SMTP id f2csp2198194vqm; Mon, 22 Jul 2024 11:45:18 -0700 (PDT) X-Forwarded-Encrypted: i=2; AJvYcCUBoGxAAMRu++SjiQE50yHBOP9Xm/OpmjuwhNI+Eio/sFjDBQg1k9vSl2nIGVAf/PsXh1IKrK7AOR/jjH2aaqFD00b6dXoS04OEFw== X-Google-Smtp-Source: AGHT+IEqyA0ye4eVO6m05f7Ek9eXQCoHZr+gXGZWyHW+J+QHenIg+tXZdRYvIJw3He3lWolD4p+d X-Received: by 2002:a05:6512:3da4:b0:52e:7f87:4e66 with SMTP id 2adb3069b0e04-52fc407504cmr530092e87.49.1721673918575; Mon, 22 Jul 2024 11:45:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1721673918; cv=none; d=google.com; s=arc-20160816; b=BljOuWUfFgbEtktPlSjN9Okp/YaZbezJfYr5DOYkOeRp8z9ntM8eZBuDwFDMwgNVS7 YbIzY9oKdMl6XWfXTsJN2gmzWi8TR8PlLlVVBpdeb795Z1hg71J07bQOcY3HfoURpJ4A iaachj3LtV5e4GNWDHQHJ/TNez40wdMwyW8KtsH6XEBAZ3vLy6iZiH9AJMBskJa/QGF9 1IQFkSYpKgOBiTN5JR/Sb0uNsTJzZFhrOJ2h5zpwXX7pHgaEZmAyZd78PKz2V4P4EOFm jO+hLIIAWHwYuYF7KFiowp02B6KmUOFySUU9aX8JMBAzhimxeGKXeNDBKx/PWTiDyroF 3wSg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:references:in-reply-to:message-id :date:to:from:delivered-to; bh=JGXt1bzakZlwhKFbH3LIhEEunJ2sQNch7VKkJVvb51Q=; fh=YOA8vD9MJZuwZ71F/05pj6KdCjf6jQRmzLS+CATXUQk=; b=jIGO8z/Ia6e/0147lpS9U1kJP0xjtplgA2P5ePRKqJQ69RYVxX7+bJo+7mJ6udOxeW QAJMmO5UiA6zTojRa1yY0NsZnogNjV0Stvd/UIktO91YbtN0OT8wga5Kj2SWVJXrrPRf CIkXZfz5qBo+AQ0Rky6N+3ZHhx4edNnsq65uiRnJtMWV9WyigneZoQNl/yMa5gg/KgYw fRFUZ9oadm/KWlOQwqz4wa8V2YVm6a18tVGLrEbPnUHQQRCzzk29z8pTXoCoK1hAt6pI vppbg8zi8zH2MVjAocs3WLdssxc70SdmyK3LhtZ0vmKTItoCZn97BYYT3bP0Iv0EJ3ps jDUw==; dara=google.com ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id 2adb3069b0e04-52ef55428c6si2184794e87.354.2024.07.22.11.45.18; Mon, 22 Jul 2024 11:45:18 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 07B9D68D6D2; Mon, 22 Jul 2024 21:44:43 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id E872968D553 for ; Mon, 22 Jul 2024 21:44:32 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id 8697BC0236 for ; Mon, 22 Jul 2024 21:44:32 +0300 (EEST) From: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= To: ffmpeg-devel@ffmpeg.org Date: Mon, 22 Jul 2024 21:44:24 +0300 Message-ID: <20240722184431.40853-5-remi@remlab.net> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20240722184431.40853-1-remi@remlab.net> References: <20240722184431.40853-1-remi@remlab.net> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 5/9] sws/riscv: require B or zba explicitly X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: 0MACfIXR9tNH --- libswscale/riscv/input_rvv.S | 12 ++++++------ libswscale/riscv/range_rvv.S | 8 ++++---- libswscale/riscv/rgb2rgb_rvb.S | 2 +- libswscale/riscv/rgb2rgb_rvv.S | 12 ++++++------ 4 files changed, 17 insertions(+), 17 deletions(-) diff --git a/libswscale/riscv/input_rvv.S b/libswscale/riscv/input_rvv.S index 1d7de59c66..d07db43b55 100644 --- a/libswscale/riscv/input_rvv.S +++ b/libswscale/riscv/input_rvv.S @@ -26,7 +26,7 @@ func ff_bgr24ToY_rvv, zve32x j 1f endfunc -func ff_rgb24ToY_rvv, zve32x +func ff_rgb24ToY_rvv, zve32x, zba lw t1, 0(a5) # RY lw t3, 8(a5) # BY 1: @@ -62,7 +62,7 @@ func ff_bgr24ToUV_rvv, zve32x j 1f endfunc -func ff_rgb24ToUV_rvv, zve32x +func ff_rgb24ToUV_rvv, zve32x, zba lw t1, 12(a6) # RU lw t4, 24(a6) # RV lw t3, 20(a6) # BU @@ -108,7 +108,7 @@ func ff_bgr24ToUV_half_rvv, zve32x j 1f endfunc -func ff_rgb24ToUV_half_rvv, zve32x +func ff_rgb24ToUV_half_rvv, zve32x, zba lw t1, 12(a6) # RU lw t4, 24(a6) # RV lw t3, 20(a6) # BU @@ -157,7 +157,7 @@ func ff_\chr1\()ToY_rvv, zve32x j 1f endfunc -func ff_\chr0\()ToY_rvv, zve32x +func ff_\chr0\()ToY_rvv, zve32x, zba lw t1, 0(a5) # RY lw t3, 8(a5) # BY 1: @@ -199,7 +199,7 @@ func ff_\chr1\()ToUV_rvv, zve32x j 1f endfunc -func ff_\chr0\()ToUV_rvv, zve32x +func ff_\chr0\()ToUV_rvv, zve32x, zba lw t1, 12(a6) # RU lw t4, 24(a6) # RV lw t3, 20(a6) # BU @@ -251,7 +251,7 @@ func ff_\chr1\()ToUV_half_rvv, zve32x j 1f endfunc -func ff_\chr0\()ToUV_half_rvv, zve32x +func ff_\chr0\()ToUV_half_rvv, zve32x, zba lw t1, 12(a6) # RU lw t4, 24(a6) # RV lw t3, 20(a6) # BU diff --git a/libswscale/riscv/range_rvv.S b/libswscale/riscv/range_rvv.S index 9da80e6199..19a74eba79 100644 --- a/libswscale/riscv/range_rvv.S +++ b/libswscale/riscv/range_rvv.S @@ -20,7 +20,7 @@ #include "libavutil/riscv/asm.S" -func ff_range_lum_to_jpeg_16_rvv, zve32x +func ff_range_lum_to_jpeg_16_rvv, zve32x, zba li t1, 30189 li t2, 19077 li t3, -39057361 @@ -41,7 +41,7 @@ func ff_range_lum_to_jpeg_16_rvv, zve32x ret endfunc -func ff_range_lum_from_jpeg_16_rvv, zve32x +func ff_range_lum_from_jpeg_16_rvv, zve32x, zba li t1, 14071 li t2, 33561947 1: @@ -60,7 +60,7 @@ func ff_range_lum_from_jpeg_16_rvv, zve32x ret endfunc -func ff_range_chr_to_jpeg_16_rvv, zve32x +func ff_range_chr_to_jpeg_16_rvv, zve32x, zba li t1, 30775 li t2, 4663 li t3, -9289992 @@ -88,7 +88,7 @@ func ff_range_chr_to_jpeg_16_rvv, zve32x ret endfunc -func ff_range_chr_from_jpeg_16_rvv, zve32x +func ff_range_chr_from_jpeg_16_rvv, zve32x, zba li t1, 1799 li t2, 4081085 1: diff --git a/libswscale/riscv/rgb2rgb_rvb.S b/libswscale/riscv/rgb2rgb_rvb.S index af127b32ed..d18e5ba01b 100644 --- a/libswscale/riscv/rgb2rgb_rvb.S +++ b/libswscale/riscv/rgb2rgb_rvb.S @@ -23,7 +23,7 @@ #include "libavutil/riscv/bswap_rvb.S" #if (__riscv_xlen >= 64) -func ff_shuffle_bytes_3210_rvb, zbb +func ff_shuffle_bytes_3210_rvb, zba, zbb srli a2, a2, 2 bswap32_rvb a1, a0, a2 endfunc diff --git a/libswscale/riscv/rgb2rgb_rvv.S b/libswscale/riscv/rgb2rgb_rvv.S index 19f7aaf67d..e1270ac0df 100644 --- a/libswscale/riscv/rgb2rgb_rvv.S +++ b/libswscale/riscv/rgb2rgb_rvv.S @@ -25,7 +25,7 @@ func ff_shuffle_bytes_0321_rvv, zve32x j 1f endfunc -func ff_shuffle_bytes_2103_rvv, zve32x +func ff_shuffle_bytes_2103_rvv, zve32x, zba li t1, ~0x00ff00ff 1: not t2, t1 @@ -54,7 +54,7 @@ func ff_shuffle_bytes_1230_rvv, zve32x j 3f endfunc -func ff_shuffle_bytes_3012_rvv, zve32x +func ff_shuffle_bytes_3012_rvv, zve32x, zba li t1, 8 li t2, 24 3: @@ -74,7 +74,7 @@ func ff_shuffle_bytes_3012_rvv, zve32x ret endfunc -func ff_interleave_bytes_rvv, zve32x +func ff_interleave_bytes_rvv, zve32x, zba 1: mv t0, a0 mv t1, a1 @@ -100,7 +100,7 @@ func ff_interleave_bytes_rvv, zve32x ret endfunc -func ff_deinterleave_bytes_rvv, zve32x +func ff_deinterleave_bytes_rvv, zve32x, zba 1: mv t0, a0 mv t1, a1 @@ -165,10 +165,10 @@ endfunc ret .endm -func ff_uyvytoyuv422_rvv, zve32x, zbb +func ff_uyvytoyuv422_rvv, zve32x, b yuy2_to_i422p v20, v16 endfunc -func ff_yuyvtoyuv422_rvv, zve32x, zbb +func ff_yuyvtoyuv422_rvv, zve32x, b yuy2_to_i422p v16, v20 endfunc