From patchwork Thu Jan 7 12:10:16 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Josh Dekker X-Patchwork-Id: 24827 Return-Path: X-Original-To: patchwork@ffaux-bg.ffmpeg.org Delivered-To: patchwork@ffaux-bg.ffmpeg.org Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by ffaux.localdomain (Postfix) with ESMTP id 1410F44AF79 for ; Thu, 7 Jan 2021 14:10:34 +0200 (EET) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id D16D268811C; Thu, 7 Jan 2021 14:10:33 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from wout3-smtp.messagingengine.com (wout3-smtp.messagingengine.com [64.147.123.19]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id F326A687F79 for ; Thu, 7 Jan 2021 14:10:27 +0200 (EET) Received: from compute3.internal (compute3.nyi.internal [10.202.2.43]) by mailout.west.internal (Postfix) with ESMTP id F2554F5C for ; Thu, 7 Jan 2021 07:10:25 -0500 (EST) Received: from mailfrontend1 ([10.202.2.162]) by compute3.internal (MEProxy); Thu, 07 Jan 2021 07:10:26 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=itanimul.li; h= from:to:subject:date:message-id:mime-version :content-transfer-encoding; s=fm3; bh=9GRO5sMUZ//HQcw8ds9RhzYTOG qLcrcB41dAKHqDFa8=; b=WQ0vcr9fpM78ilu6/ge5+idE/QmLDsheoID1Z/fYzt ykNUSgZajhbu0Ay4Bf0kBBQ5J6qofTuvCp6BzD3uyZ8FsvkAmws4vNZO+VKdbenp OnsQmhCfCXkR35Xjaar2cqunyb7tBAO6cW5Cp7dUiSFWaaq2UkoTsAZG5t3+TcBQ s4TcCp3uyC4Hev2t766YODkJ7hmwmoojG2FDnX3Ao06ana9KAzRndAg+tKQdEjUg WquqCQUssIjaauOAPMuV2nps8YfaTvPAbuUJUS1U5RnD4neodY6QN3JDySdkqkh0 ddPlhWUIaldEazn6XVN3ePRRIFC2H4gHF3Tn/lLV5XFQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:date:from :message-id:mime-version:subject:to:x-me-proxy:x-me-proxy :x-me-sender:x-me-sender:x-sasl-enc; s=fm1; bh=9GRO5sMUZ//HQcw8d s9RhzYTOGqLcrcB41dAKHqDFa8=; b=FfomeHJqO+sY5HL1jG0vSzPzV6jHehjYq pYxrKdcnylQ5dire4EgGlk/AOvw0VRlOMKJ5a0ZubnSuiZlp7M2DcXcu3CpWWOEH /AhB77O3btvs6AVKTWSjzeoweTJzMVr+JB8KaBtpm0dcEswluo+30a7JDxpTC/OB UGYdPfUtp9if8kaIeGO59kTHsEkCjc+/WtvpsdmbK1FAjSx0p3Zs06AI+94F3Laa G4WCWPYzhe3/+ReXN9GeHLC88oDZbXgLQllQ7BblUpIT4EjLWoWnhdaO+EVmBNth 1v7YQjwBuGMbzaq/r7far/rOqMUH7JbwL/kTkuuSLbV2BfXhHmz0g== X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedujedrvdegvddgfeejucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucenucfjughrpefhvffufffkofgggfestdekredtre dttdenucfhrhhomheplfhoshhhucffvghkkhgvrhcuoehjohhshhesihhtrghnihhmuhhl rdhliheqnecuggftrfgrthhtvghrnhepkefgjefgjeetgedvvddvieeijedviefgvdfgff dujeffffffjefgleekhfeugfevnecukfhppeekkedrudeftddrgeekrddufeejnecuvehl uhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhfrhhomhepjhhoshhhsehith grnhhimhhulhdrlhhi X-ME-Proxy: Received: from localhost.localdomain (mue-88-130-48-137.dsl.tropolys.de [88.130.48.137]) by mail.messagingengine.com (Postfix) with ESMTPA id 1577724005A for ; Thu, 7 Jan 2021 07:10:24 -0500 (EST) From: Josh Dekker To: ffmpeg-devel@ffmpeg.org Date: Thu, 7 Jan 2021 13:10:16 +0100 Message-Id: <20210107121020.86179-1-josh@itanimul.li> X-Mailer: git-send-email 2.24.3 (Apple Git-128) MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 0/4] AArch64 NEON for HEVC X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" checkasm: all 657 tests passed hevc_add_res_4x4_8_c: 49.7 hevc_add_res_4x4_8_neon: 20.5 hevc_add_res_4x4_10_c: 45.7 hevc_add_res_4x4_10_neon: 18.7 hevc_add_res_8x8_8_c: 211.0 hevc_add_res_8x8_8_neon: 24.5 hevc_add_res_8x8_10_c: 195.7 hevc_add_res_8x8_10_neon: 24.0 hevc_add_res_16x16_8_c: 787.2 hevc_add_res_16x16_8_neon: 79.0 hevc_add_res_16x16_10_c: 714.7 hevc_add_res_16x16_10_neon: 77.7 hevc_add_res_32x32_8_c: 3444.2 hevc_add_res_32x32_8_neon: 306.5 hevc_add_res_32x32_10_c: 3820.7 hevc_add_res_32x32_10_neon: 299.5 hevc_idct_4x4_dc_8_c: 16.2 hevc_idct_4x4_dc_8_neon: 13.7 hevc_idct_4x4_dc_10_c: 16.2 hevc_idct_4x4_dc_10_neon: 14.5 hevc_idct_8x8_dc_8_c: 40.7 hevc_idct_8x8_dc_8_neon: 18.5 hevc_idct_8x8_dc_10_c: 39.2 hevc_idct_8x8_dc_10_neon: 19.2 hevc_idct_16x16_dc_8_c: 136.7 hevc_idct_16x16_dc_8_neon: 35.7 hevc_idct_16x16_dc_10_c: 136.0 hevc_idct_16x16_dc_10_neon: 36.0 hevc_idct_32x32_dc_8_c: 1386.7 hevc_idct_32x32_dc_8_neon: 132.0 hevc_idct_32x32_dc_10_c: 1366.2 hevc_idct_32x32_dc_10_neon: 132.0 hevc_sao_band_8x8_8_c: 230.7 hevc_sao_band_8x8_8_neon: 92.7 Please disregard my previous email with subject 'lavc/aarch64: add HEVC add_residual NEON', the patch was split incorrectly. IDCT (first) and QPEL functions in the works, then SAO edge, and whatever is left for parity with ARM NEON.