From patchwork Thu Aug 19 21:07:59 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Mikhail Nitenko X-Patchwork-Id: 29621 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6602:2a4a:0:0:0:0 with SMTP id k10csp644670iov; Thu, 19 Aug 2021 14:08:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzW7c3PZ8GKsc7XmqBev2cL8du8qEkOMa1wUyR6M6VZsrNXpnFrEVFZx9LbyvXq40ijSWVm X-Received: by 2002:a05:6402:1157:: with SMTP id g23mr18795053edw.90.1629407308220; Thu, 19 Aug 2021 14:08:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629407308; cv=none; d=google.com; s=arc-20160816; b=Y215JYsyrIQF6V/evzyzf8vb56L6SN/blcN83st+z1M94weHSLAed6AAPy/ZY8zw/Z ufoCzJ0+/eyJQFsvFRaNW4QHKRoIB1thY4A8MAIPhPBb6Es6/VK0l4B06dAmvpa9H6QC Rmw1uU7dslgEs6cVQS2F23/hAunt0AsbJ/jSW8mUtgOhFOUgToVcHkIKNjJ/QmAkpSUf axN36ZoKT9NHUmnk6UNQrs5HIw9zoi/Zs+Qa7AHY43Sk6F699U3Curco0WcpyREox4qJ APc3lECcK5RlLV0ua/qzUrkAgisN5MxT570KXDobKWhzw+JjexXJM0KJhX6UPeLb92IC zmIA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:mime-version:message-id:date:to:from :dkim-signature:delivered-to; bh=BnzWk55vxaM6sJ+VoQwhS4bcnZA0xWXEJwYRvCkbKcE=; b=NeVx8bJ0ItMZgIKx4d85xr9EGGxKM5x+iLlVDhNpdkMu5EuGxWwKCoYxLBViQNhc4W 2L1AD7GF6fZ8kOyQgMLdsUuK7hGU2h6ODymLo4iBr9O7Z9g0Fkp28PsoWJfBwEY/vcb7 Plno2v+vQheDokabf233aG+LnL7DrphENVntbTgGntNgKmUq/DiSjjrt2T0IJ2W3IhWq c5AXSKBRJ2c1n9Xyhzo08rGfDLwBP9/euNRhKosAITZ3U02Gw6mOv0GFXJ4++8aIfEA1 J6pPezTNSu5PQHGDSOn6NKiav15noiIVLZF0FV9LvzKqBzcM/QI2EHesG3ioVp1vP0YI T3Uw== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20161025 header.b="S/cX76da"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id w17si4283204ejv.123.2021.08.19.14.08.15; Thu, 19 Aug 2021 14:08:28 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@gmail.com header.s=20161025 header.b="S/cX76da"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 03A2568A176; Fri, 20 Aug 2021 00:08:12 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-lf1-f44.google.com (mail-lf1-f44.google.com [209.85.167.44]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 5FE0E680CEA for ; Fri, 20 Aug 2021 00:08:06 +0300 (EEST) Received: by mail-lf1-f44.google.com with SMTP id y34so15722063lfa.8 for ; Thu, 19 Aug 2021 14:08:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=l0jbFd16MpiqLc+Si7Jp+Mj5AThrZ+ehoEHftFz1v0o=; b=S/cX76da6dqFQwEGWowAMJkCX9EDqCskks9EWzRy2fK1q8D6EqoPunJlLzpL6kdyCG HqI3+jbqAdSJ4POQfa4Z5NezJVYmoXPWYFQ7n0QUEifoGq0OpfaKuDVzfTOnAX2SDsTP 1vErDmFV7M+EB777WmXpdKSE+/XvxJaglH37Z2gEH0yaBlCQapeZtaARHSj/nfjLUViX W383mWXFgpbDpWUfgqLDahUY9YQfe7/l3iocHXysIKXiFS1ZDSp1tO5gCTfdaRPYcSxK 5mQSfhXfIB8CGNVkl9C12qnoi/zuO6hqsdGhGyu8QA0ti9ur0ZALvLIEEqCzu06LSgxk eTYg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=l0jbFd16MpiqLc+Si7Jp+Mj5AThrZ+ehoEHftFz1v0o=; b=D2wNTmjTiEJvd+5AHCfsy4w8jeIBltKsAUpDIsJBNFrHTsuJVAbyqjDbx+r9TkibJC QKlhs68SrR5NUWjtlBpVdPt3ZSnBrgL4a4lQuT1fd0EJe75UXp8gN07vX/ifOVA2aagH G0J7+kOtpevW8/b54JOevojRjh2xyR3s2UnWIRb5QQJKG1bYXR6NUpK+jneXKTIBKFAB 2Wg5EMkdJnzbofs4ecWBIBcyo0lkGMGWFUukxq8NGBanRdrzECWrgmDVXmwr0RI3BpLy TMQBgyPoC27R9POX4C4tD9gRke1KFY6qmut8VvaUgs0vYJozXGleaZipSSpId5r6PNBk sTDQ== X-Gm-Message-State: AOAM531LYFtUOF7QNawG2A0zJWm95Srx7cV8sqCLxxW9oJBuSITajYKS zwYRI2oYWhgOo+0fuzG82DV3PMrTd0Zgvg== X-Received: by 2002:a19:c112:: with SMTP id r18mr11599589lff.531.1629407285378; Thu, 19 Aug 2021 14:08:05 -0700 (PDT) Received: from localhost.localdomain ([213.87.146.53]) by smtp.gmail.com with ESMTPSA id c3sm361247ljj.77.2021.08.19.14.08.04 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 19 Aug 2021 14:08:04 -0700 (PDT) From: Mikhail Nitenko To: ffmpeg-devel@ffmpeg.org Date: Fri, 20 Aug 2021 00:07:59 +0300 Message-Id: <20210819210800.595496-1-mnitenko@gmail.com> X-Mailer: git-send-email 2.32.0 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 1/2] lavc/aarch64: move transpose_4x8H to neon.S X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Mikhail Nitenko Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: ujyBXgkGgixj transpose_4x8H was declared in vp9lpf_16bpp_neon, however this macro is not unique to vp9 and could be used elsewhere. Signed-off-by: Mikhail Nitenko --- libavcodec/aarch64/neon.S | 13 +++++++++++++ libavcodec/aarch64/vp9lpf_16bpp_neon.S | 12 ------------ 2 files changed, 13 insertions(+), 12 deletions(-) diff --git a/libavcodec/aarch64/neon.S b/libavcodec/aarch64/neon.S index 0fddbecae3..1ad32c359d 100644 --- a/libavcodec/aarch64/neon.S +++ b/libavcodec/aarch64/neon.S @@ -109,12 +109,25 @@ trn2 \r5\().4H, \r0\().4H, \r1\().4H trn1 \r6\().4H, \r2\().4H, \r3\().4H trn2 \r7\().4H, \r2\().4H, \r3\().4H + trn1 \r0\().2S, \r4\().2S, \r6\().2S trn2 \r2\().2S, \r4\().2S, \r6\().2S trn1 \r1\().2S, \r5\().2S, \r7\().2S trn2 \r3\().2S, \r5\().2S, \r7\().2S .endm +.macro transpose_4x8H r0, r1, r2, r3, t4, t5, t6, t7 + trn1 \t4\().8H, \r0\().8H, \r1\().8H + trn2 \t5\().8H, \r0\().8H, \r1\().8H + trn1 \t6\().8H, \r2\().8H, \r3\().8H + trn2 \t7\().8H, \r2\().8H, \r3\().8H + + trn1 \r0\().4S, \t4\().4S, \t6\().4S + trn2 \r2\().4S, \t4\().4S, \t6\().4S + trn1 \r1\().4S, \t5\().4S, \t7\().4S + trn2 \r3\().4S, \t5\().4S, \t7\().4S +.endm + .macro transpose_8x8H r0, r1, r2, r3, r4, r5, r6, r7, r8, r9 trn1 \r8\().8H, \r0\().8H, \r1\().8H trn2 \r9\().8H, \r0\().8H, \r1\().8H diff --git a/libavcodec/aarch64/vp9lpf_16bpp_neon.S b/libavcodec/aarch64/vp9lpf_16bpp_neon.S index 9075f3d406..9869614a29 100644 --- a/libavcodec/aarch64/vp9lpf_16bpp_neon.S +++ b/libavcodec/aarch64/vp9lpf_16bpp_neon.S @@ -22,18 +22,6 @@ #include "neon.S" -.macro transpose_4x8H r0, r1, r2, r3, t4, t5, t6, t7 - trn1 \t4\().8h, \r0\().8h, \r1\().8h - trn2 \t5\().8h, \r0\().8h, \r1\().8h - trn1 \t6\().8h, \r2\().8h, \r3\().8h - trn2 \t7\().8h, \r2\().8h, \r3\().8h - - trn1 \r0\().4s, \t4\().4s, \t6\().4s - trn2 \r2\().4s, \t4\().4s, \t6\().4s - trn1 \r1\().4s, \t5\().4s, \t7\().4s - trn2 \r3\().4s, \t5\().4s, \t7\().4s -.endm - // The input to and output from this macro is in the registers v16-v31, // and v0-v7 are used as scratch registers. // p7 = v16 .. p3 = v20, p0 = v23, q0 = v24, q3 = v27, q7 = v31