From patchwork Sun May 26 05:34:21 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= X-Patchwork-Id: 49266 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a59:542:0:b0:460:55fa:d5ed with SMTP id 63csp2597729vqf; Sat, 25 May 2024 22:34:31 -0700 (PDT) X-Forwarded-Encrypted: i=2; AJvYcCUu4YJgQ+ngH9r5cWFdJH9Vm+43zF8AYok3QU8idywcZPRqFbdqnXm8Ro9xcxaFRf1KBfmoXkOjhBAMJXlr2pNFHRNyZ3MDIknEHw== X-Google-Smtp-Source: AGHT+IHFfX4yUw4dU7xQx3F6qTwR/rJ9AQa69pgHO3SlgoMMB1k/6pSJpzCfhQ1ie3L2PV/7txjk X-Received: by 2002:a2e:6a02:0:b0:2e5:566:c752 with SMTP id 38308e7fff4ca-2e95b27b11emr49350331fa.48.1716701671515; Sat, 25 May 2024 22:34:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1716701671; cv=none; d=google.com; s=arc-20160816; b=pR14GgVQxo6ZOp5XW30xrO89NXFkM4HvX8K/OuBzcojfh5S10KtLUjxi4TwrAN2BQk 1vrcWOJkueSV5FxjLbqZnebPWMK8CH1XQhpoyI+Rwvs2IYOsNHnaMOEWFRs/vgwLHPCp yyRez54Th0jqbfmB9KAqSZXm6oya48JltAXkP4tgpG2xs0i1bh/AfT0tQekuMy8b3nCw axAEOrTCsm+Tx/+lNOBBqKMFq4aJ/daPR5l2StPh3cllpbIk58kxpyEyoj9lGDAnpxiI umszqxsemmIZ9E1LaHtAKHqn+Wxss8uhLHoh9KZnMyp8A2QeXj+K9oUfAs7d2XVkqZA/ 5wsg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:message-id:date:to:from :delivered-to; bh=LUSM+Pw3MV/gUDWpBQlU0ARwUh2VGgzc74L9d1Tlo5s=; fh=YOA8vD9MJZuwZ71F/05pj6KdCjf6jQRmzLS+CATXUQk=; b=WNK5yR+zGlpZpNBG8Y1pGNFuh2Wv5pv17tdTl3t8a1Jeu2LaWDKoTpreOvIEWR3t5U cvGuiQxvc4I7ruMHjKJJRxv8H2XsR2FRVjO5jJCF9oYPtMByN7Ce60l7X2k1GIB+hsFO VeUCKBzoplKCn6Np8eXjk/WjGqQVdlhqf2LLPyBVLPIdqhfZg5Hz8gkWE1rB/acvMAET sO7PeHmHNeN29tZVVt/c7w1Qea4DFa8uBPwSPNKpWlG5fC2GKI8/3ZLArCAc3xNHT2K5 5+QRkrh6NvnV5bctKE5w1zO9i4MlOqc3Vke3+/0b5UnmidigoXvOlHK6gc0GHZjIZQ2Z hgoQ==; dara=google.com ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id a640c23a62f3a-a626c91842fsi249232966b.227.2024.05.25.22.34.31; Sat, 25 May 2024 22:34:31 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 3619168D432; Sun, 26 May 2024 08:34:28 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 39D3168C704 for ; Sun, 26 May 2024 08:34:22 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id 6C63AC006F for ; Sun, 26 May 2024 08:34:21 +0300 (EEST) From: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= To: ffmpeg-devel@ffmpeg.org Date: Sun, 26 May 2024 08:34:21 +0300 Message-ID: <20240526053421.4122-1-remi@remlab.net> X-Mailer: git-send-email 2.45.1 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH] lavc/lpc: optimise RVV vector type for compute_autocorr X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: lO2Ro9evbaJ1 On SpacemiT X60 (with len == 4000): autocorr_10_c: 2303.7 autocorr_10_rvv_f64: 1411.5 (before) autocorr_10_rvv_f64: 842.2 (after) --- libavcodec/riscv/lpc_init.c | 3 ++- libavcodec/riscv/lpc_rvv.S | 5 +++-- 2 files changed, 5 insertions(+), 3 deletions(-) diff --git a/libavcodec/riscv/lpc_init.c b/libavcodec/riscv/lpc_init.c index ab91956f2d..f21eca4caa 100644 --- a/libavcodec/riscv/lpc_init.c +++ b/libavcodec/riscv/lpc_init.c @@ -36,7 +36,8 @@ av_cold void ff_lpc_init_riscv(LPCContext *c) if ((flags & AV_CPU_FLAG_RVV_F64) && (flags & AV_CPU_FLAG_RVB_ADDR)) { c->lpc_apply_welch_window = ff_lpc_apply_welch_window_rvv; - if (ff_get_rv_vlenb() >= c->max_order) + if ((flags & AV_CPU_FLAG_RVB_BASIC) && + ff_get_rv_vlenb() >= c->max_order) c->lpc_compute_autocorr = ff_lpc_compute_autocorr_rvv; } #endif diff --git a/libavcodec/riscv/lpc_rvv.S b/libavcodec/riscv/lpc_rvv.S index d4ea515fee..024837102c 100644 --- a/libavcodec/riscv/lpc_rvv.S +++ b/libavcodec/riscv/lpc_rvv.S @@ -86,9 +86,10 @@ func ff_lpc_apply_welch_window_rvv, zve64d ret endfunc -func ff_lpc_compute_autocorr_rvv, zve64d +func ff_lpc_compute_autocorr_rvv, zve64d, zbb + vtype_vli t1, a2, t2, e64, ta, ma li t0, 1 - vsetvli zero, a2, e64, m8, ta, ma + vsetvl zero, a2, t1 fcvt.d.l ft0, t0 vle64.v v0, (a0) sh3add a0, a2, a0 # data += lag