From patchwork Wed Oct 5 16:12:54 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= X-Patchwork-Id: 38564 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:4d9:b0:9c:f4b:4e41 with SMTP id 25csp700747pzd; Wed, 5 Oct 2022 09:13:03 -0700 (PDT) X-Google-Smtp-Source: AMsMyM4k9oz0dmOnqMM9UEsEGpiHcoQmNaqWZbq7CD4Aq+QxWT9bGDfM0MWY8yDkk15+HMkFPiPU X-Received: by 2002:a17:907:160d:b0:782:bc5d:162e with SMTP id hb13-20020a170907160d00b00782bc5d162emr275398ejc.291.1664986383338; Wed, 05 Oct 2022 09:13:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664986383; cv=none; d=google.com; s=arc-20160816; b=pfvjjvLY5wHkn9OsN99O/3VMt13MdueC+mdtJoGVG24Mtj7d3lgusFVwXMmqzwxN2+ UpkwtAO6w8Uf8+1BHd829unvifNedtyQiEvNIx5Snz3WyZc2OzQR207983L0yxsBslQ/ RgDAvBdu2XMGkPh6oujHsejOADMUoJ494rFXSYs1fnote7o/T2J9rFsZMkxdNETpjZiU 1iO+/okut8RsyVydlDqmwBOUtzcK71wTShsR3MSORGvskKzs/UJozHX9aJaHcryTwvnn faVvyhNOKNtFoSycrAoTrw+9bP0QRan4W8tuc4vKVVdqMfO0DbMv+LDIAAmdOXgkPFru SStg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:references:in-reply-to:message-id :date:to:from:delivered-to; bh=sLsjrN7p4tQkxgFGoPWacca7rv/HMum5I5NMbeO9VA8=; b=f+WWbOdr3eUiV//UWCesyLwlXqMZfUDltFC63n77xNRpc3UE8zAAyvAifU1+UzfhdT fOevkCTtt/B6yabsooLiINJ2ROmsd+/8TAz5x9cfjBCdBqF+NAWpH4SeEh+oAfcet5ob uhx+f99C9LirDepoDH/LlSjCnHQ/rKzDNLRpDTt6rJqkwfkn58+xzQHUPug1w1Hv1JQ5 ve2AogKTaeHbx5zEFy3rdvq5GU96QsXacfoDrg41IaEH4b0CSgf8R8jkkMibwggPTrS6 43cb1BKw6rHxUjXJR70H9906FZRxrPzYsrHvCvUP5fPt11os09UZ66MZUbCPBvlR4Jlq BuJw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id hq11-20020a1709073f0b00b007417e9a2c71si14965292ejc.352.2022.10.05.09.13.02; Wed, 05 Oct 2022 09:13:03 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 28BDB68BD15; Wed, 5 Oct 2022 19:13:00 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id C185B68BD11 for ; Wed, 5 Oct 2022 19:12:57 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id 5E571C0099 for ; Wed, 5 Oct 2022 19:12:57 +0300 (EEST) From: =?utf-8?q?R=C3=A9mi_Denis-Courmont?= To: ffmpeg-devel@ffmpeg.org Date: Wed, 5 Oct 2022 19:12:54 +0300 Message-Id: <20221005161256.27612-2-remi@remlab.net> X-Mailer: git-send-email 2.37.2 In-Reply-To: <12083658.O9o76ZdvQC@basile.remlab.net> References: <12083658.O9o76ZdvQC@basile.remlab.net> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 2/4] lavu/riscv: helper macro for VTYPE encoding X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: hsIaSxKhzxfl On most cases, the vector type (VTYPE) for the RISC-V Vector extension is supplied as an immediate value, with either of the VSETVLI or VSETIVLI instructions. There is however a third instruction VSETVL which takes the vector type from a general purpose register. That is so the type can be selected at run-time. This introduces a macro to load a (valid) vector type into a register. The syntax follows that of VSETVLI and VSETIVLI, with element size, group multiplier, then tail and mask policies. --- libavutil/riscv/asm.S | 75 +++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 75 insertions(+) diff --git a/libavutil/riscv/asm.S b/libavutil/riscv/asm.S index ffa0bd9068..6ca74f263a 100644 --- a/libavutil/riscv/asm.S +++ b/libavutil/riscv/asm.S @@ -92,3 +92,78 @@ shnadd 3, \rd, \rs1, \rs2 .endm #endif + + /* Convenience macro to load a Vector type (vtype) as immediate */ + .macro lvtypei rd, e, m=m1, tp=tu, mp=mu + + .ifc \e,e8 + .equ ei, 0 + .else + .ifc \e,e16 + .equ ei, 8 + .else + .ifc \e,e32 + .equ ei, 16 + .else + .ifc \e,e64 + .equ ei, 24 + .else + .error "Unknown element type" + .endif + .endif + .endif + .endif + + .ifc \m,m1 + .equ mi, 0 + .else + .ifc \m,m2 + .equ mi, 1 + .else + .ifc \m,m4 + .equ mi, 2 + .else + .ifc \m,m8 + .equ mi, 3 + .else + .ifc \m,mf8 + .equ mi, 5 + .else + .ifc \m,mf4 + .equ mi, 6 + .else + .ifc \m,mf2 + .equ mi, 7 + .else + .error "Unknown multiplier" + .equ mi, 3 + .endif + .endif + .endif + .endif + .endif + .endif + .endif + + .ifc \tp,tu + .equ tpi, 0 + .else + .ifc \tp,ta + .equ tpi, 64 + .else + .error "Unknown tail policy" + .endif + .endif + + .ifc \mp,mu + .equ mpi, 0 + .else + .ifc \mp,ma + .equ mpi, 128 + .else + .error "Unknown mask policy" + .endif + .endif + + li \rd, (ei | mi | tpi | mpi) + .endm