From patchwork Thu Jul 13 20:19:57 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= X-Patchwork-Id: 42667 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:6da1:b0:131:a7d0:bc6d with SMTP id gl33csp3182881pzb; Thu, 13 Jul 2023 13:20:08 -0700 (PDT) X-Google-Smtp-Source: APBJJlFNgSlGKbiFijZHg/RdzMIxWR6dKUJzZQTdTXLDnyuGCPd6Z0Fa7SsKOPNUXc678sl6IAIn X-Received: by 2002:a17:906:af65:b0:98d:e605:2bce with SMTP id os5-20020a170906af6500b0098de6052bcemr2237618ejb.46.1689279608221; Thu, 13 Jul 2023 13:20:08 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689279608; cv=none; d=google.com; s=arc-20160816; b=UhHPGsFi1QKLjVqMPOd/bp493eISn7+bopLkDabNPDfsgmfBx7BimRTPTgein54pfd eSpbCKiCu9qKjVb7U+tONptpHnY/ml27uE7gextzPRxS13wUnPyBNlHULomtFoFkkHQP CMWMUX2Q9jis01Wh+w9W4WxE0vZgDIYWOk6vUhTW6EfEgmD4i1W2VjdZU2QeU5qTpH/2 LlN8VZrvhzQ224SEH10Ogkq2Uaxaqo9SYH51s0Xv0b3lVj9pOQk6O2kQtpNDMQNV6/vX BGix0QJd3wLb7Zc6F+AcqkMEZYgKCIR+Tp5rcZZionkZxZrA/lwWsTjGc81bAM1Cxqwc 2XYg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:message-id:date:to:from :delivered-to; bh=d40/YxDqiXdP4P7e/5jo5AJheICnduXPJGtyTTL17z0=; fh=hQcp50obTJ8bXC1it5NuEN23RGKfx0/zZ3s2gmreL+A=; b=fixgHjxeAhlHbvDWQgGKmCovMAVnaqsLxK4fTFLVlsxiCoE9D4Bx+MCGcSkLNYN2JB BF7N/u4YLeXV+BoPVwjt4+ZUT8KnGy55zVjSi0ELuKqXizkkZTuVJLj//T395ZtYguzZ SJlyzU8T5VkSXHXBUK5rTgJf8QGLfeOVl+92FR0G8hNLx54RqM9gCST8+UR/+mdhxV9q 0xt+Fkgv4DRFeLEA2+jVuErj/UkHnun/dRjXA4asvCHhADm8FKheKGYKnDVSFYfXbhNJ gOfahS4+99upLchS7ryyquXYmDSdsonZYR0sT+GVCTrUyKGXtfCivfT66VoNrNVC5ycE 33MA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id n25-20020a17090695d900b009939cd92a18si9606342ejy.73.2023.07.13.13.20.07; Thu, 13 Jul 2023 13:20:08 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 8DB1D68C617; Thu, 13 Jul 2023 23:20:04 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 4D74F68C3C4 for ; Thu, 13 Jul 2023 23:19:58 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id 83067C01C3 for ; Thu, 13 Jul 2023 23:19:57 +0300 (EEST) From: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= To: ffmpeg-devel@ffmpeg.org Date: Thu, 13 Jul 2023 23:19:57 +0300 Message-Id: <20230713201957.10725-1-remi@remlab.net> X-Mailer: git-send-email 2.40.1 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH] lavc/vorbisdsp: unroll RISC-V V inverse_coupling X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: PR68R/ucRTlG This increases the group multiplier as per T-Head C910 benchmarks: inverse_coupling_c: 4597.0 inverse_coupling_rvv_i32: 1312.7 (m1) inverse_coupling_rvv_i32: 1116.7 (m2) inverse_coupling_rvv_i32: 732.2 (m4) inverse_coupling_rvv_i32: 898.0 (m8) --- libavcodec/riscv/vorbisdsp_rvv.S | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/libavcodec/riscv/vorbisdsp_rvv.S b/libavcodec/riscv/vorbisdsp_rvv.S index f45e7dc2f1..81a6c62a65 100644 --- a/libavcodec/riscv/vorbisdsp_rvv.S +++ b/libavcodec/riscv/vorbisdsp_rvv.S @@ -23,7 +23,7 @@ func ff_vorbis_inverse_coupling_rvv, zve32f fmv.w.x ft0, zero 1: - vsetvli t0, a2, e32, m1, ta, ma + vsetvli t0, a2, e32, m4, ta, ma vle32.v v16, (a1) sub a2, a2, t0 vle32.v v24, (a0)