From patchwork Tue Aug 20 13:22:32 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Nuo Mi X-Patchwork-Id: 35168 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:612c:40de:b0:48e:c0f8:d0de with SMTP id lb30csp307957vqb; Tue, 20 Aug 2024 06:23:06 -0700 (PDT) X-Forwarded-Encrypted: i=2; AJvYcCUo7Iy7c3QEPdMRJ9VQ0TnCandLaX6na4j6ImldASvzfHuCigjE9UpvWAH/VVZiqLwQmUk6gr3MQ0rXVMdd5ZuA@gmail.com X-Google-Smtp-Source: AGHT+IFV64A+GYNzuTYvv8OkSWIyosxlAYkLZ/QKXbsmVD8vyjt2sh1VnEryp56j3MwQoO7qrlLs X-Received: by 2002:a17:906:c147:b0:a80:d683:4d23 with SMTP id a640c23a62f3a-a8392a46e45mr1007227266b.62.1724160186607; Tue, 20 Aug 2024 06:23:06 -0700 (PDT) Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id a640c23a62f3a-a83839718f3si700844166b.858.2024.08.20.06.23.05; Tue, 20 Aug 2024 06:23:06 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@outlook.com header.s=selector1 header.b=M+rKPxS4; arc=fail (body hash mismatch); spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 59C8C68DC91; Tue, 20 Aug 2024 16:23:02 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from APC01-SG2-obe.outbound.protection.outlook.com (mail-sgaapc01olkn2095.outbound.protection.outlook.com [40.92.53.95]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id CBDA768DBD7 for ; Tue, 20 Aug 2024 16:22:54 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=c2AOsk3JgX1kvQQxANnT23sxWr3EoP6LWoB3CDjKrTGUFygZxVmIirjf3BSslUzFtl1hjQGNzwGm45ZattCAk5HthBKfDOcSjN0qJ0p3ag1ZJkHU8q9S3IEvZqECysQ0lPz6nVPqr2gBT+iYh/TC+RKT4NFWsx6G6HRmsC2xspSJvMy9qAvPXzQ961g9p+bc6p3UHrWTy85cCTwtzBZMksbG/FY0NZ5o3L3d+DDRF0UsIb8CmOY3oFxz5h1ByIpKYqD2aZAVU2I4Px8Hocqm3TiJKXzxkns7M1x3+3aBYiHocncv8rch8bRjygoYaIket0aK49fTYrM6+ROpiTQxKA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=nBhgxk2K+QTo9C+2lGn5uY7AqlCEHgfMBx1+bEX9IfU=; b=yqoUvv0RJiABgr6NPTX3/HCM9AVTVMsfzoT3R51z+Te6xDWvoLONc5Rn3lOac+oI2m+kvZIYpBlvdNgWqEeRG2TW+YFHosT3IgaVLH5L8IQ8+6Qfh/r+In4POx751svwnW6wFXxiOShH/ELK/CuQgtXoCbAEJ/3DCJAudH1zgI+ZVPFopbz/clsJ78tP3b+d8clbecUXJogSkkC+Mx6QsUEKxBlbGOHeYuL4Mnw4bfFh5hLQyjQlIRj7EfsEWNW/rS41H/5C8dnvz/cHk884vKra6+mo0nH8tI7gqN5PiutTcxWqtv3ksfzzY2VunQZ/1/YaGchsoh37xS/dM7CaAw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=nBhgxk2K+QTo9C+2lGn5uY7AqlCEHgfMBx1+bEX9IfU=; b=M+rKPxS4e9GPAk8NzaRmSC/g6089SYYdgxBRStRUw2UXKMW9wg3CaUr577K2OzuFBDHjl+T/J/nsr89P8uuMgljkVyfB/v9tmsXZlGXixkPtRdDl2pbSre65wZ/pfuVTmjuw+9HL2km5E/VqkAr4RGwyaQbz+rjKeFvrE/f0TnRkcw+gGh3+Bp+D0dDemBqsDNKFJl0A9vyb5dkzNaSmrSdQGY4Pove+GsVqkdsERHv2yTGycQZHphZ5W8OCEXnqmuSkM6rr+ywvrkoNI9jX1AF4Bk1zu8HRPiylhzUrz7M/x2R6m203pOCzJlBwGTh3JcCy9328qz/bNwmdUgVrwA== Received: from TYSPR06MB6433.apcprd06.prod.outlook.com (2603:1096:400:47a::6) by KL1PR06MB6623.apcprd06.prod.outlook.com (2603:1096:820:f9::9) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7875.21; Tue, 20 Aug 2024 13:22:47 +0000 Received: from TYSPR06MB6433.apcprd06.prod.outlook.com ([fe80::81f7:9125:583a:1cca]) by TYSPR06MB6433.apcprd06.prod.outlook.com ([fe80::81f7:9125:583a:1cca%5]) with mapi id 15.20.7875.019; Tue, 20 Aug 2024 13:22:47 +0000 From: Nuo Mi To: ffmpeg-devel@ffmpeg.org Date: Tue, 20 Aug 2024 21:22:32 +0800 Message-ID: X-Mailer: git-send-email 2.34.1 X-TMN: [fJZ7uONo0zDtN07gZW1cvPspf5Q1LOoa] X-ClientProxiedBy: TYCP301CA0028.JPNP301.PROD.OUTLOOK.COM (2603:1096:400:381::11) To TYSPR06MB6433.apcprd06.prod.outlook.com (2603:1096:400:47a::6) X-Microsoft-Original-Message-ID: <20240820132236.286553-1-nuomi2021@gmail.com> MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 2 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: TYSPR06MB6433:EE_|KL1PR06MB6623:EE_ X-MS-Office365-Filtering-Correlation-Id: f3f5faf8-acad-4f0c-6b8c-08dcc11b2e77 X-Microsoft-Antispam: BCL:0; ARA:14566002|19110799003|461199028|15080799003|5072599009|8060799006|3412199025|440099028|1710799026; X-Microsoft-Antispam-Message-Info: IkUB0m21GwhtKDiHktIZDHWQnwLMfCV4uF5CW1u+SyvqGiH/s9ucYKbZO8i6XO5rRi6vrgVbRnk6Zf63nIwU+x1/DViOLaSQU3kL5Q6lKO3lUE1CcJhAmkHbgupImK0GZ90TS7Wjhi/4cOCzg9DpuWSUv1pSWl8/BBg431nP0Tcjdddm3hpHQoOS9QSd0oi3lUje1CcydluRdoJWvdUSi0sGuScET3mRHQw/5lNK+Bfno0yMQzMMtNQ22p13xFok68NFWVtMOcw7eCyFd9PJE2v9ZKpO92hDM+MVolwqBy72hhSKK5tO02jz2I72P8KtSOUn1aXLvvPla0c/V7+bu4S5HCjs8uT+Z0rEIvNfc7m7RZRHR+Sh1hscLCIBXTlISygt6+81koFZJsSEfj4reYeB+Q85rSeWXNDPSe8g3BqBwgSKiVPzwD9JxtXtl3lwL3JhDc8lvXcYufyXZSQqs5lv6LQ81p8kfOQBhnQl/qNmVDwbdT+8IGOvJASnvfkwnMkfX9YyNL7gyTSnphFdPPH5VkwxnwyKriJd9DA05TpUKsagmeIEk26+jUevhsCDb64DGeWxZrinsPv1nn7H/5DMOkK0MjA6YC1bSHaI3CTq3AtM7BG2R09EKy0pDu6Lx4aww9FKm+RdGW1Ow1lfjwdKCJC1nvTUXg2SnhAva/OsMHkP/9dHFWgsx45hN7OF9fax+WCNUO/Q3dw9hVUhYjoGJHpGbnpqMF+/icvxci8= X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?rgs+pP7xza1s3xiiH/3j1Ip8Z13D?= =?utf-8?q?vuhw5pKRooSWG7inxaIIeqBjEWcFinvUPNdv9q5jaMYZeDPUXKbTJ2/5MKgdD0zPs?= =?utf-8?q?CTzyJztM9mz4Al8oI1woDnAtLOqPwvCIsYZ1G1UPOeQGfOkXCx2N/mJMtjG+hJ1Zg?= =?utf-8?q?Rd/Y6MufUuolEKMkmxiXut0OFZojXWsgEeQZcg2lh0g94hXFe+dZZ1mKXmXoFbX6Q?= =?utf-8?q?m3SuR/FzCEVwe6C+EUYr3PRrww5jXzcBqJFHNYVDjaQVT80w70UjCedeuJBsnc7Kn?= =?utf-8?q?P9ST/qI2npWuWsI1wTOQQUC8mX0skrYi31LEz++r4aYoKUWqSB09wMS9AGxLl/kED?= =?utf-8?q?x+CX+H8AqpeXSbrKMGEfBU4gOS1rBcdsnf5lRnfB2oX3Y/Ow/+z0gnPrt4D1u3F9b?= =?utf-8?q?1HTavkKEw03W2Ygq9eVcLQmTDir7wCss9fIcB4YfxTcCRq08nXTQvnDbUUn9gN/y3?= =?utf-8?q?u6Bn++pt7Q/V0HbQqwppwpEYDkF5SOesZogg2XObmMz+q9tqvvkX9xwWAwsmxQUls?= =?utf-8?q?5YIoQchkVrG5eIGA/8Qzt11rKwaQchH2Kk//3hhtHhzJACCXf5owpKpOA4LyXEGjx?= =?utf-8?q?ZwYTPlccwOJ60DoB5V2SmsrRbTquIDXcTDbZ6u75ZK5pfXtVdh5Tij4Iq+0IiA1T3?= =?utf-8?q?ZXY1rC10NSn28lRnt3jfMybo1EBV3bB0AnT/NdRKdbTtXRVBTFScjkPLniGp3/HNO?= =?utf-8?q?UF2wOYYlHuL1cjH8cY1fmjMCFu05M7X0uz+vpnOgcAUNq4gseBi//MKDwV3f6ezVI?= =?utf-8?q?TxN/kA4k3MfSwMaRFM9h1Bk0pdW24iJWsDQMggTj+ZX6ZrX+6m/Iy02oZUWtNW2fU?= =?utf-8?q?I9L8WKgROkA6/DAoX69/S0Rj83mRSCoMkrtnQC/wcVXR6NdRIotuHCLYPeuLJzFy1?= =?utf-8?q?O2RhPKZiEOd/JX9EoXoez4rQLVAG9zoalm2txxPFRxawYl+hgsAX3chx2VVQ4ioWk?= =?utf-8?q?uWu7kNdwLC7s1r7jR8gE4d3iuDErNI1vkMoWI82rIJc41FhLBWNeOjsyfc7R2VE/u?= =?utf-8?q?Fg+fNPIP8EhltVG9i4b7uGVQMttGBzANqZbE6uuMmECyL63UV+/W6pbnTA8ncBtYD?= =?utf-8?q?cEM6r/EEE6mgphEWLjNeWn/2Xlf48XD5iOc3IW/msThn7INA3OeQdEC20aZtasHAX?= =?utf-8?q?q4ZRrAat7DZ+zeRqphoclb4izVoL8UBYuMnamssv6l20eOLmdZfZBlHSdZX3HQyB6?= =?utf-8?q?H5qmFOaKZoPLI+b+Gs3GB4t9/HA+7Yjv/91wzuQ=3D=3D?= X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: f3f5faf8-acad-4f0c-6b8c-08dcc11b2e77 X-MS-Exchange-CrossTenant-AuthSource: TYSPR06MB6433.apcprd06.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 20 Aug 2024 13:22:47.1511 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000 X-MS-Exchange-Transport-CrossTenantHeadersStamped: KL1PR06MB6623 Subject: [FFmpeg-devel] [PATCH v2 0/4] add bdof to the vvc decoder X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Nuo Mi Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: MLTOGycnTBZz Major changes since v1: remove hadd and pmulld as James suggested reduce vextracti128 Nuo Mi (4): avcodec/vvcdec: misc, rename BDOF_BLOCK_SIZE to BDOF_MIN_BLOCK_SIZE avcodec/vvcdec: bdof, do not pad sources and gradients to simplify the code x86/vvcdec: inter, add optical flow avx2 code checkasm: add vvc_bdof test libavcodec/vvc/dsp.c | 27 +-- libavcodec/vvc/dsp.h | 4 +- libavcodec/vvc/inter_template.c | 75 +++--- libavcodec/x86/vvc/Makefile | 1 + libavcodec/x86/vvc/vvc_of.asm | 385 +++++++++++++++++++++++++++++++ libavcodec/x86/vvc/vvcdsp_init.c | 21 ++ tests/checkasm/vvc_mc.c | 50 ++++ 7 files changed, 494 insertions(+), 69 deletions(-) create mode 100644 libavcodec/x86/vvc/vvc_of.asm --- 2.34.1