From patchwork Thu Jun 15 10:36:41 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?b?5rKI5L2p5am3?= X-Patchwork-Id: 42099 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:c526:b0:117:ac03:c9de with SMTP id gm38csp670448pzb; Thu, 15 Jun 2023 03:37:20 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ7DVNtcqqBMj2xnU+199jX8ffbH6NaLg7Ao6xBwlLyKFlNmtHKjA6u2n/wB3wAyiizrNd+T X-Received: by 2002:a17:907:6e86:b0:982:84c9:96c4 with SMTP id sh6-20020a1709076e8600b0098284c996c4mr3077696ejc.10.1686825440157; Thu, 15 Jun 2023 03:37:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686825440; cv=none; d=google.com; s=arc-20160816; b=Bdi2mJWeyFiz36CUpMd3msf66PaQdqavsnWZk8XxdmTgddtHmC43XCpD8AEyx20r5B n6OqOv7DPTqRgy6i2WEEUxZxJMUpOxV29Z0jvsQUkvZFgkN7XhRGaEV3Ehok66u1/f8R dBcpVx4kTcikBbpiAhOnHZ0JgiNUHr1vtzjT7NdPj+mPVeEAEXkRjyQ4M2MYLFAQecS0 5hmFd+ztaJjy/N6e9HmpUr3+kCbzxbcS00S0pkY8ju6aAmeSTCdd+qrU5c+U9BpPeJVa UB88Sok43JVekw7C07T1e0S0Kuo7TMy8k4W2K/duJBdcQ38NKSc8g2YnIU2DKY9OKLdA 6HmQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:mime-version:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:references:in-reply-to:message-id:date :to:from:delivered-to; bh=Wimc5MaXceztihdhdq7Lm+7XyBw5zfe7fYGzKlmsjhc=; b=Qjm5ldW+Cm956XrQmI5CfTs4Wv44PED/H9/3JStOJsSJi7diB0qbtb7NcmC3FmzFCs cT//Wg9qvA/f0O+iXbCTw+IL3LNPmLA/DRlsZoz8eRZnjk3buujQOL+Dw/YAHrxm6m/m G17DsgNqtB0kzq5qwB7VZ/bafcmLwJZWrs00SDp+J3PQ9zmvqA4kzezni9ztUX3P4ygK AHh8NHMM7Fxo+gvSyTNGH+pYh3WuZv2E3O0EjNNwIeCykDtgCRMHLxPnBWO7laHbIliI g1F0p0Yfc1EL4RgXFIclZyqln9YzS3/9eHw3J+GOQAXh7IrS84Ra11Y9UBeEkocJHtsZ 3MBA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id o11-20020a170906860b00b009827cfec095si1822569ejx.784.2023.06.15.03.37.19; Thu, 15 Jun 2023 03:37:20 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 46BB168C4FC; Thu, 15 Jun 2023 13:37:00 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from zg8tmtyylji0my4xnjqumte4.icoremail.net (zg8tmtyylji0my4xnjqumte4.icoremail.net [162.243.164.118]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 6683B68C2EA for ; Thu, 15 Jun 2023 13:36:52 +0300 (EEST) Received: from host042-ubuntu-1804.lxd (unknown [10.12.130.38]) by app1 (Coremail) with SMTP id EwgMCgDnhMS_6Ypk6XcmAA--.44896S6; Thu, 15 Jun 2023 18:36:50 +0800 (CST) From: Peiting Shen To: ffmpeg-devel@ffmpeg.org Date: Thu, 15 Jun 2023 10:36:41 +0000 Message-Id: <20230615103645.25778-3-shenpeiting@eswincomputing.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20230615103645.25778-1-shenpeiting@eswincomputing.com> References: <20230615103645.25778-1-shenpeiting@eswincomputing.com> X-CM-TRANSID: EwgMCgDnhMS_6Ypk6XcmAA--.44896S6 X-Coremail-Antispam: 1UD129KBjvJXoW7WFW5tFy7tw4kXF48WF1Utrb_yoW8ZFW7pF 4fGryfZrn3XrZ7C3ZxGFykZF15Kas5GFn5JFnrZFWxZr4jv347JrsFyr1YyryjqrZ5ZF1Y 9FW0gw13Cr18GFJanT9S1TB71UUUUU7qnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDU0xBIdaVrnRJUUUB214x267AKxVW8JVW5JwAFc2x0x2IEx4CE42xK8VAvwI8IcIk0 rVWrJVCq3wAFIxvE14AKwVWUJVWUGwA2048vs2IY020E87I2jVAFwI0_Jryl82xGYIkIc2 x26xkF7I0E14v26r1Y6r1xM28lY4IEw2IIxxk0rwA2F7IY1VAKz4vEj48ve4kI8wA2z4x0 Y4vE2Ix0cI8IcVAFwI0_tr0E3s1l84ACjcxK6xIIjxv20xvEc7CjxVAFwI0_Gr1j6F4UJw A2z4x0Y4vEx4A2jsIE14v26rxl6s0DM28EF7xvwVC2z280aVCY1x0267AKxVW0oVCq3wAS 0I0E0xvYzxvE52x082IY62kv0487Mc02F40EFcxC0VAKzVAqx4xG6I80ewAv7VC0I7IYx2 IY67AKxVWUGVWUXwAv7VC2z280aVAFwI0_Gr0_Cr1lOx8S6xCaFVCjc4AY6r1j6r4UM4x0 Y48IcxkI7VAKI48JM4x0x7Aq67IIx4CEVc8vx2IErcIFxwCY02Avz4vE-syl42xK82IYc2 Ij64vIr41l4I8I3I0E4IkC6x0Yz7v_Jr0_Gr1lx2IqxVAqx4xG67AKxVWUJVWUGwC20s02 6x8GjcxK67AKxVWUGVWUWwC2zVAF1VAY17CE14v26r1Y6r17MIIYrxkI7VAKI48JMIIF0x vE2Ix0cI8IcVAFwI0_JFI_Gr1lIxAIcVC0I7IYx2IY6xkF7I0E14v26r4j6F4UMIIF0xvE 42xK8VAvwI8IcIk0rVWUJVWUCwCI42IY6I8E87Iv67AKxVW8JVWxJwCI42IY6I8E87Iv6x kF7I0E14v26r4j6r4UJbIYCTnIWIevJa73UjIFyTuYvjfUOc_TUUUUU X-CM-SenderInfo: hvkh01phlwx03j6h245lqf0zpsxwx03jof0z/ Subject: [FFmpeg-devel] [PATCH 2/6] lavc/ac3dsp: RISC-V V float_to_fixed24 X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Shen Peiting MIME-Version: 1.0 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: 3dxzIpCHEQFj From: Shen Peiting Vector instructions replaces scalar options of float convert to fixed Benchmarks on Spike(cycles): len=16 float_to_fixed24_c: 315 float_to_fixed24_rvv: 27 len=160 float_to_fixed24_c: 2871 float_to_fixed24_rvv: 67 Co-Authored by: Yang Xiaojun Co-Authored by: Huang Xing Co-Authored by: Zeng Fanchen Signed-off-by: Shen Peiting --- libavcodec/riscv/ac3dsp_init.c | 5 ++++- libavcodec/riscv/ac3dsp_rvv.S | 19 +++++++++++++++++++ 2 files changed, 23 insertions(+), 1 deletion(-) diff --git a/libavcodec/riscv/ac3dsp_init.c b/libavcodec/riscv/ac3dsp_init.c index bb67d86998..a4e75a7541 100644 --- a/libavcodec/riscv/ac3dsp_init.c +++ b/libavcodec/riscv/ac3dsp_init.c @@ -25,13 +25,16 @@ #include "config.h" void ff_ac3_exponent_min_rvv(uint8_t *exp, int num_reuse_blocks, int nb_coefs); +void ff_float_to_fixed24_rvv(int32_t *dst, const float *src, unsigned int len); av_cold void ff_ac3dsp_init_riscv(AC3DSPContext *c) { int flags = av_get_cpu_flags(); #if HAVE_RVV - if (flags & AV_CPU_FLAG_RVV_I32) + if (flags & AV_CPU_FLAG_RVV_I32) { c->ac3_exponent_min = ff_ac3_exponent_min_rvv; + c->float_to_fixed24 = ff_float_to_fixed24_rvv; + } #endif } diff --git a/libavcodec/riscv/ac3dsp_rvv.S b/libavcodec/riscv/ac3dsp_rvv.S index 879123f4a7..d98e72c12c 100644 --- a/libavcodec/riscv/ac3dsp_rvv.S +++ b/libavcodec/riscv/ac3dsp_rvv.S @@ -44,3 +44,22 @@ func ff_ac3_exponent_min_rvv, zve32x 3: ret endfunc + + +func ff_float_to_fixed24_rvv, zve32x + addi t1, x0, 1 + slli t1, t1, 24 + fcvt.s.w f1, t1 +1: + vsetvli t0, a2, e32, m8 + vle32.v v0, (a1) + vfmul.vf v0, v0, f1 + vfcvt.x.f.v v16, v0 + vse32.v v16, (a0) + sub a2, a2, t0 + slli t0, t0, 2 + add a1, a1, t0 + add a0, a0, t0 + bgtz a2, 1b + ret +endfunc