From patchwork Mon Oct 17 13:07:11 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Hubert Mazur X-Patchwork-Id: 34838 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:4a86:b0:9d:28a3:170e with SMTP id fn6csp1584380pzb; Mon, 17 Oct 2022 06:08:26 -0700 (PDT) X-Google-Smtp-Source: AMsMyM5BPsZXuC1lbrIpnK9fyL+tXqeCxURdeyUAIUxVCb/eZyiwV+q/HvDrFhE8rtOHGqk5exTk X-Received: by 2002:a17:906:5dcc:b0:78d:e76a:ef23 with SMTP id p12-20020a1709065dcc00b0078de76aef23mr8313237ejv.317.1666012106752; Mon, 17 Oct 2022 06:08:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666012106; cv=none; d=google.com; s=arc-20160816; b=dRqFL+hnei4LRqDiNkkTDeJp74F7qZLxg0DOIaxVCWSrpJvIkNHfKmKh1A8hQUuHIt GnfPnGg6iB3voF9/0QSc2QLkiqoGqNWXCNjIYrV2XFngMuFQS+7r+tbZzv6gchi1uuJB uKv4udqHnBo2ZyBxsUybNk4oXvw122iirGASk3MN5svRyEcyQwcq6zmP2fSluczBbbAO rOgjKeKZ3fghPuWYg7EnvmpdfD246KGUQFCfXJI120Nfz7WHhuvCNAwQ+EUSUaPhrmj+ wJCDfsxIifM18SBg3FFWipW3O+ZQ4RoS+/GLCg2IceFE/0OCuZL1jQSXomGM3gAnelss qIYg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:mime-version:message-id:date:to:from :dkim-signature:delivered-to; bh=j//C/hBOPppMQfotjPcUuj3Q2jIrz9RDjb2tivm3Fi0=; b=bkYgDcBm0dezAe+ULM2BUA1I2vZm46IRqjavYJKdYjcCJVCdwDRlxlsLkJVWAaIanS FeTtYWY8ARMqNYypn8AkdldpHDKLySB95kM/GWwxm+rKAUWN7d0dhUV8XbO6FlGrPg44 kJk37DWiecLg2qGCR2UnRniHw/dJ1XMHgbakqBecEiX1zYcQYqnkjWuOspjMDhH1CgMd 6dvYb2rQo5WCHDKGDr/LTh6Nzackv2q6Vo2xSC70a+tR1Nlx5knsgTvBKHx+AYOFfono vtstL2qq6h4S7fE83nGk4u42vq904Iu3eisbpsT0eM9vMwmUfvdxb4XX1iAWHx5dyqU+ N6bQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@semihalf.com header.s=google header.b="hlJtv/Im"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=semihalf.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id qk38-20020a1709077fa600b00774195db4e7si9418090ejc.117.2022.10.17.06.08.26; Mon, 17 Oct 2022 06:08:26 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@semihalf.com header.s=google header.b="hlJtv/Im"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=semihalf.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 7927F68BD03; Mon, 17 Oct 2022 16:08:23 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wr1-f51.google.com (mail-wr1-f51.google.com [209.85.221.51]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id AAE7D68BBD6 for ; Mon, 17 Oct 2022 16:08:16 +0300 (EEST) Received: by mail-wr1-f51.google.com with SMTP id f11so18365870wrm.6 for ; Mon, 17 Oct 2022 06:08:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=semihalf.com; s=google; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=TBy6PwCo1upPJpCuB1lwINYNn1Y18n+PPByVJx2znfA=; b=hlJtv/ImxEIfbQWNgGR7Hd3gjXcMaq2Dnln6onSQSuTcHUrj4RluIzjfu2o0am8IJT BMdfUULbpdz9Pd8hlRXiH3TKbDtTT8XOUAaXhDblcEAbfvrlaOpcbhv5pSUUc1RT2B++ 1axCM7+NLr3uBYCO4psoQol3MxB/234JKQv9SzKLRCwSrI9CiZgKDb9ozxhvHqXOYE9K QEGeI81q4Bp76WRIwnmspcbk6VMovLCCm9NPdOu5nCRS5JGmAlv00IjqTXi9pilt5G5L a7GosSO6hS3rv8QtcM56fQSy4crG3t8t8xjByW/IOied2soILyjhEI8bqBO43plU8RLR ydcQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=TBy6PwCo1upPJpCuB1lwINYNn1Y18n+PPByVJx2znfA=; b=BT/qJ9q+AAOwWg2wHtJexDok8qCkYMQvmv0Q9E8x4qk7qQCl8Lhh9km1GBFk7AYMob zQ4SKDbu9qn+Fta3hab+gjz3XUEx3gjFtWOGGnByWXMmwzGV03L7KQe2eHsUJGFUEb/R SUQru/VgPCcV6SF5D8auLB9KRLFnNERaQBS30VKCWo/JKGdNfIKzAnvUERVQo8hATXBK O0B8S2slHLIkCsAeJt8hdiAVIqF3LiL6yhNm6TT9nhBUGipN62WzbMexsYqTlk79dLeN UMViVTv4t/7TeHLufOm/AHh2cYTWgWmYaOfPfAPV2sAJStn6blLg3a2DnSAmNPq3F7oz xLLw== X-Gm-Message-State: ACrzQf3L0zfIFIl+avfytgYvVdiQ41twMh990/6Z3218tLMl+uukLIAb ET0Na+fuj0UIH4PumsGd4gtop4sV8HLvCg== X-Received: by 2002:adf:dbc5:0:b0:22c:c605:3b81 with SMTP id e5-20020adfdbc5000000b0022cc6053b81mr6100595wrj.218.1666012095501; Mon, 17 Oct 2022 06:08:15 -0700 (PDT) Received: from ip-172-31-3-164.eu-west-1.compute.internal (ec2-54-154-193-154.eu-west-1.compute.amazonaws.com. [54.154.193.154]) by smtp.gmail.com with ESMTPSA id t18-20020a5d6a52000000b0022af865810esm8297237wrw.75.2022.10.17.06.08.14 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Mon, 17 Oct 2022 06:08:14 -0700 (PDT) From: Hubert Mazur To: ffmpeg-devel@ffmpeg.org Date: Mon, 17 Oct 2022 13:07:11 +0000 Message-Id: <20221017130715.30896-1-hum@semihalf.com> X-Mailer: git-send-email 2.37.1 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 0/4] Provide neon implementations for hscale functions X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: gjb@semihalf.com, upstream@semihalf.com, jswinney@amazon.com, Hubert Mazur , martin@martin.st, mw@semihalf.com, spop@amazon.com Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: cu63Iemm2o/Y Provide arm64 neon optimized functions from swscale family. Hubert Mazur (4): sw_scale: Add specializations for hscale 8 to 19 tests/sw_scale: Add test cases for input sizes 16 sw_scale: Add specializations for hscale 16 to 15 sw_scale: Add specializations for hscale 16 to 19 libswscale/aarch64/hscale.S | 1101 +++++++++++++++++++++++++++++++++- libswscale/aarch64/swscale.c | 145 ++++- libswscale/swscale.c | 3 +- tests/checkasm/sw_scale.c | 35 +- 4 files changed, 1268 insertions(+), 16 deletions(-)