From patchwork Wed Mar 8 10:00:52 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: =?utf-8?q?Martin_Storsj=C3=B6?= X-Patchwork-Id: 2831 Delivered-To: ffmpegpatchwork@gmail.com Received: by 10.103.50.79 with SMTP id y76csp955563vsy; Wed, 8 Mar 2017 02:09:38 -0800 (PST) X-Received: by 10.28.4.10 with SMTP id 10mr23494689wme.142.1488967778796; Wed, 08 Mar 2017 02:09:38 -0800 (PST) Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id u74si3679671wrc.274.2017.03.08.02.09.38; Wed, 08 Mar 2017 02:09:38 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@martin-st.20150623.gappssmtp.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 6CE9C688313; Wed, 8 Mar 2017 12:09:22 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-lf0-f67.google.com (mail-lf0-f67.google.com [209.85.215.67]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 0DC5368827D for ; Wed, 8 Mar 2017 12:09:21 +0200 (EET) Received: by mail-lf0-f67.google.com with SMTP id y193so2012300lfd.1 for ; Wed, 08 Mar 2017 02:09:34 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=martin-st.20150623.gappssmtp.com; s=20150623; h=from:to:subject:date:message-id:in-reply-to:references; bh=Tntcb11o/V0sqhsHbWgkQiBB9U5DAou07XMfKsbrwbA=; b=r1qJwecp6haFhpQ3WDARWiLgHsSgBrpkIb9J9FldRtDKbD971POQEmGILzcVGUvdVZ InTANsDUtDG60qDnK2ZnPq5sugnWXkStlb8dDGFBc7sMP//ZjUT56gSQWiMoVKqr+Vwx mjJcMrJs3YRSLHtLqAwytUWakL4X+J8y6VqxqBBuQj90z1xNH+0HEXA1y8vPvwyGYuDp 6wHO5wAOxmAHCP1yPZNf2Vj1ml46giLdSQ2UUaA3M4Bb5/9GOnWv+4tlSc+SuuyaMNHm nz9gLHix0rfiPuuxo62TXBwrvQy8h8YE7gpSrafCfN1fz/x2u4NdZYG300QPYNgbW+9r J0bg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:subject:date:message-id:in-reply-to :references; bh=Tntcb11o/V0sqhsHbWgkQiBB9U5DAou07XMfKsbrwbA=; b=YN2hu/F6vSub1hPTStk8mRWUYuR6KfpPx0jXzUMZPkn/yUUeYjzFr7F3g1z5GzZW7P +wSVe62ute6t1x5jSEUPB+IyoYFmOZRxydBlMqdglrrm62Il5lsjuOwsIcOEj562G8K8 ggjzMXO/KNK8kpt9Z1GVao8o/9wFK83yJBbDdlko3VVp7b1PvonAZkUrbaIzg0X7EfIG 43EXpQkdXyy/FsFfeF8rbuaS1R2xO/4Id1AH/hknXcVMmhJ0Mk7fST75h0Kh7vLggpRV swjGFDO+L7o/VEZdz53S0mcJjDMyxnwebU5FqCDTx1MXjcaln4jJIX8cxxeMhAwBnpdZ dNQA== X-Gm-Message-State: AMke39kJy7t1J7/Tp6a1fon8jxxvtnqkRD3KsF2+pw8RYG5+s1mfcZTPF/W2CFi+pFUXFg== X-Received: by 10.46.75.9 with SMTP id y9mr1815524lja.62.1488967286135; Wed, 08 Mar 2017 02:01:26 -0800 (PST) Received: from localhost.localdomain ([2001:470:28:852:7d47:68e:13e8:4933]) by smtp.gmail.com with ESMTPSA id m127sm513064lfg.58.2017.03.08.02.01.25 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Wed, 08 Mar 2017 02:01:25 -0800 (PST) From: =?UTF-8?q?Martin=20Storsj=C3=B6?= To: ffmpeg-devel@ffmpeg.org Date: Wed, 8 Mar 2017 12:00:52 +0200 Message-Id: <1488967274-8143-12-git-send-email-martin@martin.st> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1488967274-8143-1-git-send-email-martin@martin.st> References: <1488967274-8143-1-git-send-email-martin@martin.st> Subject: [FFmpeg-devel] [PATCH 12/34] aarch64: vp9itxfm: Use the right lane sizes in 8x8 for improved readability X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches MIME-Version: 1.0 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" This is cherrypicked from libav commit 3dd7827258ddaa2e51085d0c677d6f3b1be3572f. --- libavcodec/aarch64/vp9itxfm_neon.S | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/libavcodec/aarch64/vp9itxfm_neon.S b/libavcodec/aarch64/vp9itxfm_neon.S index e42cc2d..3b34749 100644 --- a/libavcodec/aarch64/vp9itxfm_neon.S +++ b/libavcodec/aarch64/vp9itxfm_neon.S @@ -385,10 +385,10 @@ function ff_vp9_\txfm1\()_\txfm2\()_8x8_add_neon, export=1 .endif ld1 {v0.8h}, [x4] - movi v2.16b, #0 - movi v3.16b, #0 - movi v4.16b, #0 - movi v5.16b, #0 + movi v2.8h, #0 + movi v3.8h, #0 + movi v4.8h, #0 + movi v5.8h, #0 .ifc \txfm1\()_\txfm2,idct_idct cmp w3, #1 @@ -411,11 +411,11 @@ function ff_vp9_\txfm1\()_\txfm2\()_8x8_add_neon, export=1 b 2f .endif 1: - ld1 {v16.16b,v17.16b,v18.16b,v19.16b}, [x2], #64 - ld1 {v20.16b,v21.16b,v22.16b,v23.16b}, [x2], #64 + ld1 {v16.8h,v17.8h,v18.8h,v19.8h}, [x2], #64 + ld1 {v20.8h,v21.8h,v22.8h,v23.8h}, [x2], #64 sub x2, x2, #128 - st1 {v2.16b,v3.16b,v4.16b,v5.16b}, [x2], #64 - st1 {v2.16b,v3.16b,v4.16b,v5.16b}, [x2], #64 + st1 {v2.8h,v3.8h,v4.8h,v5.8h}, [x2], #64 + st1 {v2.8h,v3.8h,v4.8h,v5.8h}, [x2], #64 \txfm1\()8