From patchwork Mon Oct 3 14:10:16 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: Grzegorz Bernacki X-Patchwork-Id: 38535 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:3b1c:b0:96:9ee8:5cfd with SMTP id c28csp1345671pzh; Mon, 3 Oct 2022 07:11:22 -0700 (PDT) X-Google-Smtp-Source: AMsMyM72CZ4iu98MMIy/VrjqZNGtvNBP5Z0T++SiapAkiqekzpsWisuI3jPohUv09ZximSeYq7ou X-Received: by 2002:a17:906:ef90:b0:77f:8f0d:e925 with SMTP id ze16-20020a170906ef9000b0077f8f0de925mr15125037ejb.622.1664806282022; Mon, 03 Oct 2022 07:11:22 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664806282; cv=none; d=google.com; s=arc-20160816; b=RzEBvHvDYQmGTE9kTZDfRiT0Qs4hy+qhIZ9S9W10FoQoWoDK7VhZGAgKkM9TyPDN/y WzN8khMrbR3X4wrbRArqFNymOTFE92R3v033V53B7jiAvFT/Mls3K+t3N6Q+7XdB2Ylq Mfoe7k0ZhJiDtCkGzQ73B2VnJeDgFgNZGB2x+fdOm7H2idIrmCmJ+71wXyRnHzUg3hr4 bkZXFSeiAXeghzFhB8+YMXIpAj1Yx5NX99luVBXeqoGULIy4VE/uj5FT5Y9pP2ZETd4v nwxxZ4DmP2n8RDE8O+a4I5bpKkZcDa1O0w9RULUX5nmE+ezIPZ1SnKh+OAyEuem/DA24 HnAg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:mime-version:references:in-reply-to :message-id:date:to:from:dkim-signature:delivered-to; bh=WPvGf6QIWeSVdnIPc8j4VVhXgDZEVqFhErifw458VFw=; b=Rw9u5TTFqAmVuscW6VRS/5uq+cZg8luTZW65uCewYxk63VZ6uZxSLorImkjg5W/JL+ SnEnzEHiBpjiECAlZErn4Pv9ztyOftzkT0PO/uJSE11h9wNz1W5PP3EBJH7sNRLHVZJi OzZZ/nKsaCys0A2KhgUCS3f16cOqAPAy4JuP2P4PpnKC/kqoOQJkh7xMphQPBO3QGAyX 1sFNiY8rPak7DAvCRZuixWkwH03ewXZukZOhO6L+xIvbUqUeF9PfUTNsydIk1fl0Qdt8 vp5t0gXfj5ImKvSQE6P2Ug7BWYHPcwSRdfcfzdVs+wwMIn0mQBJyF5ZMKIHELLEm9FCn nWdQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@semihalf.com header.s=google header.b="il4jqz2/"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=semihalf.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id ka10-20020a170907990a00b00770880dff50si7943088ejc.586.2022.10.03.07.11.20; Mon, 03 Oct 2022 07:11:22 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@semihalf.com header.s=google header.b="il4jqz2/"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=semihalf.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 5D76B68BBA8; Mon, 3 Oct 2022 17:10:48 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-lf1-f52.google.com (mail-lf1-f52.google.com [209.85.167.52]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id F1E6468BB7E for ; Mon, 3 Oct 2022 17:10:40 +0300 (EEST) Received: by mail-lf1-f52.google.com with SMTP id bu25so16771739lfb.3 for ; Mon, 03 Oct 2022 07:10:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=semihalf.com; s=google; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date; bh=3oaapjlB6f0bywOoXnNK0i9IlZfR5xmvpXqxaW3CPwQ=; b=il4jqz2/JxuNHXSJiuv1BBA+F1SldLoejMDLI6Rjnrd3qE1PRFbXIJgXx8AyT6Ehkb D06vhi5GPxT5bqfcriaHjmqBiKNimtcn3u01gOluyQyOrRJ7nmJPqpqKOLef9RT6jICJ 74ucedvOCBqdSzMlJhGSB7Gnx0XZ7tKmCeGa7GwOte/o52CHEkKt+DBEr1m75yG8KCwN Ya2zt5PDMVaXCl1hNrYpEFxpr74teQyEqfbL8LbMeWJLlWWuDq/x9GDL5a1kjnzIevxA 6H1ohaOoVufmYFH3P7d+I56oFPULGk61tzWNliLgAZ90yrH007N81bDQohONLNKttfEY 45Aw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date; bh=3oaapjlB6f0bywOoXnNK0i9IlZfR5xmvpXqxaW3CPwQ=; b=WOJMY5YmTpKnQ3U5VIGcYdaqtBGjJT2RsBchDgvyQPwQHiXVeaz1Ke+VPl51DXMfC2 EmkNUVTnuJFzBBO7egIdH6JKeEphSDnjOfcMgfN7pZfYvRf5OqZBW735Nz1Y0FK6HxkE DaAmZTPGgEoTfgA5VABs2MX39fQ83Id1Eq62U028Bb4JFETMT+UZx8T4VYLXzNiOK3kS DCfVruJJekzN5hmn+g41lFQwR/f0Qf2l34rhAYcWQd5UXB6oBk+6LTjLzHVe6ZsKC+HX gX6q7WRdayYrN7slhWid+rWi1BtsvPw6qcp92QoDLAEkC0hQk4HskBsWsPkrGKSy1AIZ HOWg== X-Gm-Message-State: ACrzQf2fWmdFV4xMBGGceEafSa+FH0P3t9gNH2hg7mBC+bE1BU83km8z emVc17BSsE5ho6p2SobvjwVBOy7AI8/9PQ== X-Received: by 2002:a19:6446:0:b0:49a:9b06:f4be with SMTP id b6-20020a196446000000b0049a9b06f4bemr7912614lfj.157.1664806240027; Mon, 03 Oct 2022 07:10:40 -0700 (PDT) Received: from gilgamesh.lab.semihalf.net ([83.142.187.85]) by smtp.gmail.com with ESMTPSA id k15-20020a05651239cf00b00499b19f23e8sm1470610lfu.279.2022.10.03.07.10.39 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 03 Oct 2022 07:10:39 -0700 (PDT) From: Grzegorz Bernacki To: ffmpeg-devel@ffmpeg.org Date: Mon, 3 Oct 2022 16:10:16 +0200 Message-Id: <20221003141020.3564715-4-gjb@semihalf.com> X-Mailer: git-send-email 2.29.0 In-Reply-To: <20221003141020.3564715-1-gjb@semihalf.com> References: <20221003141020.3564715-1-gjb@semihalf.com> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH v2 3/7] aarch64: me_cmp: Fix up the prologue of ff_pix_abs8_xy2_neon X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: gjb@semihalf.com, upstream@semihalf.com, jswinney@amazon.com, hum@semihalf.com, martin@martin.st, mw@semihalf.com, spop@amazon.com Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: nQuMhTCnh/kY From: Martin Storsjö This initializes things properly if this were to be called with h < 4. --- libavcodec/aarch64/me_cmp_neon.S | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/libavcodec/aarch64/me_cmp_neon.S b/libavcodec/aarch64/me_cmp_neon.S index 3662419edf..cfba3eb33a 100644 --- a/libavcodec/aarch64/me_cmp_neon.S +++ b/libavcodec/aarch64/me_cmp_neon.S @@ -245,12 +245,12 @@ function ff_pix_abs8_xy2_neon, export=1 cmp w4, #4 add x6, x5, 1 // pix3 + stride + 1 - b.lt 2f - ld1 {v0.8b}, [x2], x3 ld1 {v1.8b}, [x0], x3 uaddl v2.8h, v0.8b, v1.8b + b.lt 2f + // make 4 iterations at once 1: ld1 {v4.8b}, [x5], x3