From patchwork Fri Oct 28 11:34:36 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Hubert Mazur X-Patchwork-Id: 34846 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:85a8:b0:a2:d5a7:ad9d with SMTP id s40csp729709pzd; Fri, 28 Oct 2022 04:35:03 -0700 (PDT) X-Google-Smtp-Source: AMsMyM69JEzy8vj2eoq86taAAj2CPBcC+nala9n1Np3fPkQOho3a1xHLCN++VQni0wHhBycGGDnd X-Received: by 2002:a17:907:7d8f:b0:78e:2cba:560f with SMTP id oz15-20020a1709077d8f00b0078e2cba560fmr7067747ejc.173.1666956903264; Fri, 28 Oct 2022 04:35:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666956903; cv=none; d=google.com; s=arc-20160816; b=mLvfZW4144Mc7GC41U8//C8FwI/RaxZ5A4uszt1sz4wccMOcASx7RUC8GV9ZAkxnfy Uaz0ueMsUCNHRzU31Pt88gc0aprqA8LnDB9X1zC4mo6voUxflHaHWw0I3G7nzCA6VQ8z llHk9cFlLs+osqDo8BzQyJr0UYNSVQF5mM47OXIcsdfjJHm4sis3KMT0aX+NKRf7n7dZ Q2L2li3BsKEzghazdQNlmtrbL9LcoEvCupthlncmYSuWPlC+ReByjkyFD9kJ8aMIPjgJ VybEQ+cKEj+XML3fuFpWMgDtuoV6AIuKZw08KzbnrvoCZagMbnMDIsEqWOtHTrlF38L5 58sg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:mime-version:message-id:date:to:from :dkim-signature:delivered-to; bh=rIStnla41AWOCd0nOuUG5lp41QuoNXjEgVskAokT4DA=; b=XBuhFk/sSlzKM4Jet7XIakuwOeZ3tE58At+p1vyxVAC4jm09zdGYDX54SONwF6DYht /59B35omIWTkPuQ4I36Dd94J9nUsinmsJmFCp3ZTMFCFv6oABBhxC5BDj0zqMvMcPnrP Cnyiek1cMbLbw6tggiX2/iQnTxqjnyXo/Hr3Go90GHzh3AT5pfbiegejBivar97+yW0k AVrEKcsy/BBamPnPiH5Ps4m9aU8SSB4CY5MHrj6ShQNehWfGP005r2A5WiT0gKZD77bv DQz+kpz/IzEDfYrjLxtvG2+ynZ9EqYBRGs/dFo1MlNcBCJnRzsuqACwbYfrgzIK12ieV RKvg== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@semihalf.com header.s=google header.b=YR1xAlSc; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=semihalf.com Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id n19-20020aa7c453000000b00461ace746adsi4045955edr.453.2022.10.28.04.35.02; Fri, 28 Oct 2022 04:35:03 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@semihalf.com header.s=google header.b=YR1xAlSc; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=semihalf.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 398DD68BCD1; Fri, 28 Oct 2022 14:34:58 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wm1-f44.google.com (mail-wm1-f44.google.com [209.85.128.44]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id B4D8668A42A for ; Fri, 28 Oct 2022 14:34:51 +0300 (EEST) Received: by mail-wm1-f44.google.com with SMTP id c7-20020a05600c0ac700b003c6cad86f38so6421565wmr.2 for ; Fri, 28 Oct 2022 04:34:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=semihalf.com; s=google; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=HvXiX8HKZpBP6aQz/kKLSLB51lyFCH8OZTASUshXcmc=; b=YR1xAlScEiPHeUhjfZXYmlHZKyphX4QgSG9OOZ6xqaEQ+wdgoFJLBh4ad7x5SKnGpG VrWPnZx+m90jK2G65Dp4IxyyVtNqgPMqy8l8PxxKAzkWAStvZ0DNXsgXA2PZrnT2IC/r vPNUWu9hmwq43nS17CoPFC6ih9J0imz9ywKauNUUNbCo8+tyACmsWFScVfFUYqn2k9cs wBIJ51XraOc60zCXGTiOU6zxILclJgQGvwe2ACaW6B1Rk+orpg1rULT6sUFuSH8Ulfy1 4Qe/JC3YYRsQiYfwQrxiHhDUZz61WMfhpmzdVc5fm+LjHF+z3I1OOCTtIyia7vhlO4mz S4ww== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=HvXiX8HKZpBP6aQz/kKLSLB51lyFCH8OZTASUshXcmc=; b=Rg48rndSz3gZqfvbQ+DYqBqntmG1KQomrFeQARA7FWAkpAiIY4pHVCaeLGkyRUvYBV WGFSfg719MxdBMPTs++uie76joERKo7jAV8xB4ZKufzzunQk7mcKhEMFdaWZoSCkz+Fh f+OmwmPZOjZCEtA20WqN6SWA/DIxMFG8AJ3sl+VPidwcrtyx9kDbw/Z2hvmWuaJQjKc2 NvyiZdwwV2AJukWRa8PA9dbTxH0yBQSyPVM3URqnNZRzB/VNqL8mXAJohc2zn9/cavdb Eh3ganZzyzGGEpgDmMNmE9UToeNOBZWXvMI/jTqT+nafNA+qLA0BDzvIFe+I4dwIkzST Eo3A== X-Gm-Message-State: ACrzQf1ZBtDALp79ZJYFPCv+Gfv864WfHY49DDgPji1rKhOuigm9JklG b2JbzIEShrzviiimyNP5pZbJeLIjp5iasV5z X-Received: by 2002:a05:600c:2d86:b0:3cf:5580:c84c with SMTP id i6-20020a05600c2d8600b003cf5580c84cmr4530446wmg.146.1666956890791; Fri, 28 Oct 2022 04:34:50 -0700 (PDT) Received: from ip-172-31-3-164.eu-west-1.compute.internal (ec2-54-154-193-154.eu-west-1.compute.amazonaws.com. [54.154.193.154]) by smtp.gmail.com with ESMTPSA id m124-20020a1ca382000000b003c70191f267sm7216852wme.39.2022.10.28.04.34.49 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Fri, 28 Oct 2022 04:34:50 -0700 (PDT) From: Hubert Mazur To: ffmpeg-devel@ffmpeg.org Date: Fri, 28 Oct 2022 11:34:36 +0000 Message-Id: <20221028113439.30279-1-hum@semihalf.com> X-Mailer: git-send-email 2.37.1 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 0/3] sw_scale: Provide neon implementation for hscale X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: gjb@semihalf.com, upstream@semihalf.com, jswinney@amazon.com, Hubert Mazur , martin@martin.st, mw@semihalf.com, spop@amazon.com Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: +e4kPFIyD9Ap This patchset contains arm64 neon implementation of hscale functions. Fixed minor style issues and declared C function wrappers as static. This patchset do not contain the patch for checkasm tool, as the previous one did. The reason behind it was failing tests on x86 arch but not on aarch64 or loongarch. Probably the hscale functions on x86 have some bugs. Currently the checkasm tool does not check the validity of hscale functions on x86 at all. Implementation of hscale for x86 should be fixed anyway. As it comes to aarch64 the tests were passing. Attaching a link to fate result in patchwork. After fixing x86 the patch for checkasm could be merged. https://patchwork.ffmpeg.org/project/ffmpeg/patch/20221017130715.30896-3-hum@semihalf.com/ Hubert Mazur (3): sw_scale: Add specializations for hscale 8 to 19 sw_scale: Add specializations for hscale 16 to 15 sw_scale: Add specializations for hscale 16 to 19 libswscale/aarch64/hscale.S | 1100 ++++++++++++++++++++++++++++++++++ libswscale/aarch64/swscale.c | 140 ++++- libswscale/swscale.c | 1 - 3 files changed, 1236 insertions(+), 5 deletions(-)