From patchwork Fri Feb 24 12:31:29 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?b?5b6Q56aP6ZqG?= <839789740@qq.com> X-Patchwork-Id: 40492 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:5494:b0:bf:7b3a:fd32 with SMTP id i20csp1200645pzk; Fri, 24 Feb 2023 04:31:50 -0800 (PST) X-Google-Smtp-Source: AK7set9cMdnsntftdURhJ/Oa5TwQiLtw4eGFv+pINM0hrVChWZUz06pzwES9mtMO+olzE6Jv8ano X-Received: by 2002:a17:906:b751:b0:8b1:77bf:3bdd with SMTP id fx17-20020a170906b75100b008b177bf3bddmr28830564ejb.36.1677241910701; Fri, 24 Feb 2023 04:31:50 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1677241910; cv=none; d=google.com; s=arc-20160816; b=V/uaQhjq7zfKvB1kIpou18n71SwGkSnTL36MVJZ/WyBrw7Ntfdsjv4doByJ8FlTYqM cpO7bBU4bpnvZckup3o988v21cdIhkamKxqK7rK3D/OEZstoU/AczUoMrW1i80LRBYI1 SCwcmZTqKhC0vTkDIToc10UxtSiDu8bSeaXLFbLh+jnFDj/gfZjl2ilq+NUoEY4mWz0g 560SHkFeU5nyIo3EMnV8zq9+ItvQguCb3sw+KvYU+cMV2u0Jv8f/4A3rUHBKedsJDE5x 8sA1X7esg2aSdcP5xaXXjt1Zrg/pmvSV9AWPXOXqhqEUW8tsH9euD2b2PrlHVzFQuP2U njtA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:mime-version:date:to:from:message-id :dkim-signature:delivered-to; bh=zcT/FgrM1WZS2iJFWDgc8wb23vkDxe16YVtURgefvWE=; b=RWiltc8oiguDKuj+FsbYJGf/Z5W29Wm2JQJawE0UMfWTNfIdMvOORWxRUcqI3Rvn1Q X4IGXd1XHLfx3xoNAKovL7Th9p8eRr7MF+Mgnh6I0CJqzGav3mwBaNkCAOmBQ3vGNyib dMPEJitenG1dddi3m+5giafuiO4gg1BPHcS05jQWDNq9YU0Z/eQX7xzzxBLLKIdFShFJ hV90eVy58K4VZdk2fZZtpmPxAqzWSiMZyco1wk1DL0Wo8P9/rAlM6G3xB090QmW7Nno/ vWLFFTp61qICvlQE8cvRGGl05S/H+Gpi3NG1dJpznnpvZusBiePaM6D0wN9QZon51jRL vdHw== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@qq.com header.s=s201512 header.b=UTaFvlg2; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=qq.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id k18-20020aa7c392000000b004ab0d0a4759si4847227edq.410.2023.02.24.04.31.49; Fri, 24 Feb 2023 04:31:50 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@qq.com header.s=s201512 header.b=UTaFvlg2; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=qq.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 621CF68BF0E; Fri, 24 Feb 2023 14:31:45 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from out203-205-221-153.mail.qq.com (out203-205-221-153.mail.qq.com [203.205.221.153]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id AD24C68BD12 for ; Fri, 24 Feb 2023 14:31:37 +0200 (EET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=qq.com; s=s201512; t=1677241893; bh=ebwWqPVlNTV8dS12m4ad/hzffMEZKlEmrk6tf6smULU=; h=From:To:Cc:Subject:Date; b=UTaFvlg2785XpUbx9kLRYFGQyZ1nhjzCrQo1ht8SPhtTgKfKApbW6OapoZNTn28jH I/+XIzXT+Kd1if+J5fw8NQaZNACLxxcu5lOPJaLWNxSEEye721Lu+08cFFyOVLjRGI S9fdQey1ayCl3Q9uEZZb6MEa3sX9ZbzR0uSfypko= Received: from localhost.localdomain ([113.65.131.200]) by newxmesmtplogicsvrsza10-0.qq.com (NewEsmtp) with SMTP id 7E08DCC9; Fri, 24 Feb 2023 20:31:32 +0800 X-QQ-mid: xmsmtpt1677241892tp8fi42m3 Message-ID: X-QQ-XMAILINFO: OZZSS56D9fAjnjvW0PeBVkjoCV4zbDuCAy3Tv8/3bYo4RbLUEbnQce3DMs8CJ0 ojRRCiZK3dJlEFHemiNHuc9x4H1ffpaxs0ra+JMX9waf40RGz5d+IdcE24uSFzgBMAam3sf+yT/M hrRRMJFQ/x6M2DJdWlt55dDtnt/1S3drf/qXG0YlSKrRfk/8falpdrpgxFnY4sO7f9fV5KonNOKQ yjqxXdgYSCnkyXd0dMHDXTgidsKbiijjC8WilAq4dB/cOM8pRbnwPjTrv3QRw3HO/VX76cg4YAKF Z7wIjuCniCOlrFg/J1BJoyvjM8MCjVTvSYWvD08qYl9NTC8znA+DklyvAlQTtXisIsmLEk6iEoZV MapFpMfAn1sT/RqpijYyk2rubO8zXQZUCtPKm7CpKasOLhhdVGu8Jvy+c79q0JH5+54kUeQUSQCB 4Mc7bl+ePYOzmaYG2oUzTJrBqlzFAZS7QrhtKTJVMToEIEe7YvMuBTJU+hsXXuBe1xbTwSA/hdVI EV9pYykF7xaELk4SObVdlJqUPpgRWS474OiELExJRgDnQ+ddakYAu8MRdtU80UNr4HkpNznR2249 tQxCU0FTxrsnq8czC+yBWThyD3xGurcVrafIoIsAugLhHZ/LOBc+NQF7iUFQBiDNBJDE+WSWfGzx yemf/V67CfuFH4eghUFIZi8gPYL40KPFj3zANln8fe8moEDRx+EJgzSCVRT8yUIt9Zc7XX+yc1oj 3wXsTnw19QUATcllTvWBhgIEYAaCqVHJy947SHjZMxb9nfB/pVYm4bM9WrdOg86H5kVrclxp7M8B nOdNu85fshPFb30EeHYI5ZRl4mxE8KC+Wlm539WNwb0Ox1LE+wlZGirCf6gx1sxy5hXdRkDORaw/ W1/y4laEQIcd+dFxR+pJ+Og55jGVo/mvLtG+dLz1JvLJqcpA3ivZFG2X4ZF4MkuUtrEJYm9+25/t hymzHV49JF7cK4EfaduueVcCk1ATsIRitGy/zpLso= From: xufuji456 <839789740@qq.com> To: ffmpeg-devel@ffmpeg.org Date: Fri, 24 Feb 2023 20:31:29 +0800 X-OQ-MSGID: <20230224123129.12041-1-839789740@qq.com> X-Mailer: git-send-email 2.32.0 (Apple Git-132) MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH] libavcodec/hevc: reuse scale_store on idct32x32_neon X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: xufuji456 <839789740@qq.com> Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: tJpx0A3KmKzZ --- libavcodec/arm/hevcdsp_idct_neon.S | 18 ++---------------- 1 file changed, 2 insertions(+), 16 deletions(-) diff --git a/libavcodec/arm/hevcdsp_idct_neon.S b/libavcodec/arm/hevcdsp_idct_neon.S index 75795e6a6a..41ca3b83a8 100644 --- a/libavcodec/arm/hevcdsp_idct_neon.S +++ b/libavcodec/arm/hevcdsp_idct_neon.S @@ -876,28 +876,14 @@ function func_tr_32x4_\name movrel r9, trans + 32 vld1.s16 {q0}, [r9, :128]! vld1.s16 {q1}, [r9, :128] - - bl tr_block1 - add r4, sp, #2048 - vld1.s16 {q14-q15}, [r4, :128]! - butterfly32 q14, q10, q15, q11 - scale32 d22, d23, d20, d21, q1, q14, q10, q15, \shift - - vld1.s16 {q14-q15}, [r4, :128]! - butterfly32 q14, q12, q15, q13 - scale32 d2, d3, d28, d29, q1, q14, q12, q15, \shift - transpose8_4x4 d22, d20, d2, d28 - transpose8_4x4 d29, d3, d21, d23 + bl tr_block1 mov r1, r11 mov r2, #64 mov r8, #-64 add r3, r11, #(56 + 3 * 64) - store16 d22, d23, d20, d21, d2, d3, d28, d29, r8 - - @ reload multiplication coefficiens to q1 - vld1.s16 {q1}, [r9, :128] + scale_store \shift bl tr_block2 add r1, r11, #8