From patchwork Mon Nov 21 12:44:06 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: James Darnley X-Patchwork-Id: 39374 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:999a:b0:a4:2148:650a with SMTP id ve26csp2163104pzb; Mon, 21 Nov 2022 04:46:58 -0800 (PST) X-Google-Smtp-Source: AA0mqf6C09IrZYrngN6OpGy7UxjA95hkArVXufPL62D0tqzXqHuJDrBm9As7XZdC6sfNGcW3EN+y X-Received: by 2002:a17:906:6d8a:b0:7ad:69eb:923b with SMTP id h10-20020a1709066d8a00b007ad69eb923bmr15735077ejt.19.1669034818483; Mon, 21 Nov 2022 04:46:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669034818; cv=none; d=google.com; s=arc-20160816; b=VQeNDOnL7lBvInHYhKPfduujwBch8I5xMlNC/r5RJMzQ5FsoDQqNil7mb/qf3vo7fd ag11AsV2R6QlXGmWsctKj0te6tA50Du9IeciYjraw9yXMSZx35BUia2He/sWRgRF9X45 b4eYHuR82n8JkejBQ5U8t1VqP86HBSVqSpFXFGvudTF9Xoxq1lXtsA5saCT2ZA+TI0tR g1k1MgoU2aE14XXsd/9KgLAkJjXfjJXsO2VjqoEI3zb+Xy1JJXRBkgHRznafMouNRcTN 3kio0/OuHCtmL7x0vQKXIXTQX4CobOOCF67jjCw3GOe7wZM6lbbyF37vWDWJgKfAVcRr PeHA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:message-id:date:to:from :dkim-signature:delivered-to; bh=XHkGNsXNUXEZAcMIFoL4oJTejpscVDW8maCSYDanFpo=; b=y6ILyxTpvSQie/WLgstPhC5HmZkbMFaRe24w7/OhcxhigIvJLgdXQ/rNAQFRRMh5py RaJDWJUky7vXFpGGzWfID+VDDNgWWRd29y0UdBug447rQI/kkw5PoDe562DEx8ts1JPL 6vcMkD8JsZPLbP0EZBsCloG1zNLpV57DAvb0KikSQJ/2gvkfbx41S9CaqFzTmOG9mR05 ACiKXb9X+eKQV91YOLeSV69Z6BSr1p0t3bFZNGO6huU6xPOswg+UCIha/nDkX+ULHi/K d93AxLfy/Uv0ie/nfyk+Zzy7aXZQ76uLAVGmmiwL1YHy2T5UCzIQxVAvEgvTKfl3W/Pd 5RiA== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b="Jaf4FK/1"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id sd9-20020a1709076e0900b007826e727aeasi10174187ejc.44.2022.11.21.04.46.56; Mon, 21 Nov 2022 04:46:58 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b="Jaf4FK/1"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id B40A468BC82; Mon, 21 Nov 2022 14:46:52 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wr1-f44.google.com (mail-wr1-f44.google.com [209.85.221.44]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 85A4968B735 for ; Mon, 21 Nov 2022 14:46:45 +0200 (EET) Received: by mail-wr1-f44.google.com with SMTP id e11so6767524wru.8 for ; Mon, 21 Nov 2022 04:46:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=obe-tv.20210112.gappssmtp.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:from:to:cc:subject:date:message-id:reply-to; bh=6APrhdr2tO7IEUdoPIT++xznrP0IXYQUt8LyIK6OS8E=; b=Jaf4FK/1Sig8uAu7dIv1PIVkfnWDbqjVbwsOO1563JaxJ8elMg4MOSDIBELPeKycOD SVE2tlwjhjjALsbyl1m1s7r+Wxa3t339fyxWco93SlT+2HQLNAODwWmINodjCW8EZoj/ qGr2ovsOSbyZRNhcu4ZR3E7SIqVuUG10D13HR8meWYAK4j31aLV2qRT4TthpUXw+hBgv UH3zef2K5uK57DuCpUqy+DISF7sYvbb9GHj5PZz6Qky9jVqtRSuU9Mz/r7VKH8OZR5I8 CcupKP/Hx1DMglw2QAcpS7WSmP3jK0ul+/bsLAkmAcRkw6kpvlb2FCXYZoRnYr89rX1t LjkA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=6APrhdr2tO7IEUdoPIT++xznrP0IXYQUt8LyIK6OS8E=; b=KHvOPhel4jzAjCTH6GB0ZRibS+glvz1SUwAxE/UAsPcj+1nK3u68EaLa2GGOXUDQnd xbgUPd9K/y2jRDXX3Qcc8Q6xc/6bCvmSWaQhfQBG+iHSprdXqnflNaX5VzElWiE8CX3b oVStD0ecLWbrcsjiFjHEqaPfANFj84dzWpR6A30AmgDQEpGZuuPJVssWEiYY4F7lzXNo l6mZ7o9YpEglzEINBTosi+FzWeVLGSmW2pjyAGf/4UP1VeyAEUbWN2II3WzbJnjuBbqs /2s5Xu6asBIwZD97xWCet0+RbLc5Ns2ULppCbQovuMBS4/bh5xbmBpttRN4hcatq5lUj 3+5Q== X-Gm-Message-State: ANoB5pmuOXPdrk6kiAdwyVJgWC2cFE9zp9PXprHOuJ/lX3HHJMiPfkdn EtuT9ovBvMBLYuUjTMuwZZask3TARcegZA== X-Received: by 2002:a5d:5186:0:b0:241:dadb:51de with SMTP id k6-20020a5d5186000000b00241dadb51demr1507592wrv.142.1669034804605; Mon, 21 Nov 2022 04:46:44 -0800 (PST) Received: from Dana.systemlords.lan (d51A44418.access.telenet.be. [81.164.68.24]) by smtp.gmail.com with ESMTPSA id v10-20020adfe28a000000b0023647841c5bsm11384358wri.60.2022.11.21.04.46.43 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 21 Nov 2022 04:46:44 -0800 (PST) From: James Darnley To: ffmpeg-devel@ffmpeg.org Date: Mon, 21 Nov 2022 13:44:06 +0100 Message-Id: <20221121124408.1577897-1-jdarnley@obe.tv> X-Mailer: git-send-email 2.38.0 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 1/3] checkasm/v210enc: test the entire width of 10-bit planar input arrays X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: c58uo2KJuLaQ --- tests/checkasm/v210enc.c | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/tests/checkasm/v210enc.c b/tests/checkasm/v210enc.c index 9942e08137..9fb8321c25 100644 --- a/tests/checkasm/v210enc.c +++ b/tests/checkasm/v210enc.c @@ -72,8 +72,10 @@ randomize_buffers(mask); \ call_ref(y0 + y_offset, u0 + uv_offset, v0 + uv_offset, dst0, width); \ call_new(y1 + y_offset, u1 + uv_offset, v1 + uv_offset, dst1, width); \ - if (memcmp(y0, y1, BUF_SIZE) || memcmp(u0, u1, BUF_SIZE / 2) || \ - memcmp(v0, v1, BUF_SIZE / 2) || memcmp(dst0, dst1, width * 8 / 3)) \ + if (memcmp(y0, y1, BUF_SIZE * sizeof(type)) \ + || memcmp(u0, u1, BUF_SIZE * sizeof(type) / 2) \ + || memcmp(v0, v1, BUF_SIZE * sizeof(type) / 2) \ + || memcmp(dst0, dst1, width * 8 / 3)) \ fail(); \ bench_new(y1 + y_offset, u1 + uv_offset, v1 + uv_offset, dst1, width); \ } \ From patchwork Mon Nov 21 12:44:07 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: James Darnley X-Patchwork-Id: 39375 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:999a:b0:a4:2148:650a with SMTP id ve26csp2163222pzb; Mon, 21 Nov 2022 04:47:06 -0800 (PST) X-Google-Smtp-Source: AA0mqf4f6CHUmxeKkglgOELU0ur7fXGuq2U2Zta/ERUYG6edyjIf7qy8M7BAPmg0OvugEaM3Xfjn X-Received: by 2002:a17:906:6aca:b0:7ad:dfb5:a3f2 with SMTP id q10-20020a1709066aca00b007addfb5a3f2mr1648106ejs.351.1669034825973; Mon, 21 Nov 2022 04:47:05 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669034825; cv=none; d=google.com; s=arc-20160816; b=nRcrvR6RlsFa7wW/dR/ff/Ps/RlX8aEJ3Jmzr4bhl2Vuu9C4Fp+R1oaygVU4jV0Q0K XfZGfKMtBcrARMCktwWzgQIvER363Pw/3xHyJ86F0WH1k0892tRLRDZr6E9wnUR8aARu jIm3O+cX4Ng4INIyhrdY6kWP9T3JGw06sJsHDf37KRc75DrhdIWNJXxu1bOQ4H7Q8r/X dydQjvusXxd0VOx4huH9PFNuTBFGL3TskAcpBSlqNlfJXk75BESBfYu7z56npZe9kVF5 pBmN/6LHlWYMRaHb95SdFBO0snV7kQViWaVesJ3mNMTEoangqNQ0a+dPcM7ULTa6sUzk hewQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:references:in-reply-to:message-id :date:to:from:dkim-signature:delivered-to; bh=EydehEC8FsgQDczo1fB0c2RCouwV1yfXRlWZFDr50KI=; b=LMl3sCBrwWn4T+J2vQ7JafrIz/T4ZMNHd2DEU4sLtz8oAhHU6xO+nJ+p9Od8vGQBlw JGSHoLOmrzW9nVuYoC6zUBuJfSSchqiDLIlaoeJHHXfatnO2nXmZ0+DjY2DeossE1rKT CXLX1Xy5NQrc3Ys9iqNGApXyJWer7WtxqqdqYYz9cjTMJUuDhYe0UqyZ2iUgZHQuociz 6Amxl5YifPUIbrJkhOZk1drfN+2zxSm8+PSUaACNVNwTUG2iWb6mTdIiCOVePAwHXCFz sXeOFRb3ItE9PD4EeSsLWV+OykoXKmitnAFHfXkCeGjuHfqZKSGzn4ymhDqvGUgRMuPh sDLw== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b="vCR/rapS"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id w7-20020aa7d287000000b0045731196587si8266682edq.64.2022.11.21.04.47.05; Mon, 21 Nov 2022 04:47:05 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b="vCR/rapS"; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id CFB6D68BBE9; Mon, 21 Nov 2022 14:46:53 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wr1-f49.google.com (mail-wr1-f49.google.com [209.85.221.49]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id E0FA068B735 for ; Mon, 21 Nov 2022 14:46:45 +0200 (EET) Received: by mail-wr1-f49.google.com with SMTP id e11so6767563wru.8 for ; Mon, 21 Nov 2022 04:46:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=obe-tv.20210112.gappssmtp.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:from:to:cc:subject:date:message-id :reply-to; bh=l3LRWkFpoGwI1bI/DkUaIsNbQnnT0mQ+74J1/7F+wKg=; b=vCR/rapS8vTKQccq8Mp0nhqiT6CkwOLT1Y+3u6HnC7cIAC/HaBzBiLBFX7xjyFZYGt vi8fOxZBGjPp6ppMHNb3g2xhrHa6I8mP+C41lEx9TTpfUuAPD35iwTZhVvJB1eIYqh2A 75CndU1wPmELrQUZBM054XeH2of83DGeRoBwVsmNtBeYauHplyXdBVXCj4jnCwXcJfmZ plXD8C68ouWDbTWYkpvacHv1UgkCldMXtaZbLOkW6FQAZyHs6/ccPc2FvGgi4Woo9c07 Nw+vXw7nNj9sDJlNNp3wWhTrakdfMvQdskuUx623dR+isNjxYkgL/t7u9U4dXp8IKcPT jOkg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=l3LRWkFpoGwI1bI/DkUaIsNbQnnT0mQ+74J1/7F+wKg=; b=Zk8JfUm1lNg+RzCz9gUYRx1Ukt4ylyHILT9qWfH6w+JYwht2WB0jIkZJ/P/aCZWFDL UOcCCX7HaiDZlaFytPRIJ2LrB3/BRWh4eb1FbLcjiWgFgqnWrO0VXPOMHWizhHggRpOr TJB2kEStTBasG5LyICkBqbSCuFmLzXcIzTSDvh0kz9La3IYYCt5lthqtjOLd6bP1UXOL BEBHdrAh+HftPwb72VSBppv5f9V31O8Ze54K+cKZM5opSZAcU9sajmEQWfqffFl9oxKk PmY+3aIWSsmyn4b83QgOW6/4hMOD7SSPN4LIpevLYdX26AwzKFg0Wt5lLEwCladQ4OKi hSrQ== X-Gm-Message-State: ANoB5pkTUBp6ZjkzXg4W/H4u6LYAVWksijypF4t3y35WQECKpn7VSm9C fiiLAVS6mlOrmZYXddcLvNErZt87dhaTSQ== X-Received: by 2002:a5d:6747:0:b0:22e:34ee:c67d with SMTP id l7-20020a5d6747000000b0022e34eec67dmr393599wrw.300.1669034805394; Mon, 21 Nov 2022 04:46:45 -0800 (PST) Received: from Dana.systemlords.lan (d51A44418.access.telenet.be. [81.164.68.24]) by smtp.gmail.com with ESMTPSA id v10-20020adfe28a000000b0023647841c5bsm11384358wri.60.2022.11.21.04.46.44 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 21 Nov 2022 04:46:44 -0800 (PST) From: James Darnley To: ffmpeg-devel@ffmpeg.org Date: Mon, 21 Nov 2022 13:44:07 +0100 Message-Id: <20221121124408.1577897-2-jdarnley@obe.tv> X-Mailer: git-send-email 2.38.0 In-Reply-To: <20221121124408.1577897-1-jdarnley@obe.tv> References: <20221121124408.1577897-1-jdarnley@obe.tv> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 2/3] avcodec/x86/v210: replace register use with named register X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: nFC8tIIWuH7k --- libavcodec/x86/v210enc.asm | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm index afac238ede..c2ad3d72c0 100644 --- a/libavcodec/x86/v210enc.asm +++ b/libavcodec/x86/v210enc.asm @@ -62,7 +62,7 @@ SECTION .text ; v210_planar_pack_10(const uint16_t *y, const uint16_t *u, const uint16_t *v, uint8_t *dst, ptrdiff_t width) cglobal v210_planar_pack_10, 5, 5, 4+cpuflag(avx2), y, u, v, dst, width - lea r0, [yq+2*widthq] + lea yq, [yq+2*widthq] add uq, widthq add vq, widthq neg widthq From patchwork Mon Nov 21 12:44:08 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: James Darnley X-Patchwork-Id: 39376 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:999a:b0:a4:2148:650a with SMTP id ve26csp2163377pzb; Mon, 21 Nov 2022 04:47:17 -0800 (PST) X-Google-Smtp-Source: AA0mqf4ctIQKIgPY5upCtnjmUnKEtP27IC0OdOhTBEdm+NwPW5bHxrAhuI5NFMuKYIfuv5A8kfvp X-Received: by 2002:a17:906:560e:b0:7a2:335e:90e2 with SMTP id f14-20020a170906560e00b007a2335e90e2mr15100123ejq.712.1669034836889; Mon, 21 Nov 2022 04:47:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669034836; cv=none; d=google.com; s=arc-20160816; b=zXyvt8VNiZhfA/UAyMvvdxX9DX4hI85T4TsMq6JN1j11ocg6isB11moH7KMzx0Wr2l 1qQJuLANIFIvu5bzueKg5Ci37v8+oL4GjKGOLdXETAOvCsS6UeUBESSuIaanQEH4ULXk lRSr92d3CyBzDTbitPLA/fUZDIK9PV+44R8LusQvCw/zBxv7/1tpBatsYQWFH407LmDc 1jTAJAbYM28A7AzA4L/nPkaRvmX/PT3HaMYITMMgDN9d9bSPhsPNpNtFR2ZuNqqJgaY5 o4jHm67bNpiLJFE1yohcDzJy8jfdMwUv01rp++RgE8cL6+vK2IgaeMGhVnuiSgYpWCxB e9Sw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:references:in-reply-to:message-id :date:to:from:dkim-signature:delivered-to; bh=Zlc9YL/PO/ApjExSCN8d0mF2jw6uCFR8ttEGLl9GcgU=; b=HT7eRqpaDWWFEPxfrLPsQr0FEcwZV+y2UKcpArfw6H5/wMLCzX4m1bWH7QcCKVDjJs ed1+u3vuBOdt4d1vA3kbNLcwnLHaD+mGs+yf7kLYyhQdfQCp7aZqlRkXS2vfhxi1lX5M Wf66i7Y9mYRBVvbUQaaKiCYuK7EO5bODJYAlYvlDarcWB2l4JCRprQfiDGnIo9n8czKv qCT+LSPQyxqSZN/PBe3dII73k0k3SWrinxZbBZm19u+HM5I1E0n1w5+NW97OnyOYzCfm jZATdH/dT8hnKqgP/OyiBYay7LVkCvf2GWo3hr1Df5WAmVC0kC+7v7SsvUGNCFr+oOzb VhcA== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b=JJ3GDTZ+; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id d7-20020a056402516700b0046844a19058si8403450ede.288.2022.11.21.04.47.14; Mon, 21 Nov 2022 04:47:16 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b=JJ3GDTZ+; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 11CD268BD2B; Mon, 21 Nov 2022 14:46:55 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wr1-f41.google.com (mail-wr1-f41.google.com [209.85.221.41]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id A7B5468BBE9 for ; Mon, 21 Nov 2022 14:46:46 +0200 (EET) Received: by mail-wr1-f41.google.com with SMTP id s5so2483348wru.1 for ; Mon, 21 Nov 2022 04:46:46 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=obe-tv.20210112.gappssmtp.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:from:to:cc:subject:date:message-id :reply-to; bh=qQMXRztU5KymF01a9CI7+oX6XLN2nPddYv+8cc7Id9A=; b=JJ3GDTZ+ETpX/8juaMuGrQ3ByNJS25DRWoy13P3DA8zdwQ4RVr9Hhwx6G1pj8UUbEc VPOekMeTeTAGc321FHp9TtzDHDjTq4fX5W3xVMPOMXvaU93/A96uGXtpIeOMiYikZP1G ueE+RVCXj5lPhRjH9rvt0748v1ZfFQP/gRAaU/ekCk6Ai8zSbm8r8hCn8r6m41O95srq VkYft5K6OATbM599ZvZqXRfthaZbbonP1DUqSB9OFD/RoWYZCoxxOP5z5k/AulM/IhbP Nlii3Dp/bXTIZt/MJqM61GyBNHyEX968EjFeDp/G82Wnx17IGXyQuugBY1MHbLpaaoDy NJRg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=qQMXRztU5KymF01a9CI7+oX6XLN2nPddYv+8cc7Id9A=; b=O2usFWogjkjwwKs7EEBX6so0tSOu43HXExvJS52wAs/s2rPqor6zMgI/rYsvlFC4f3 RvyqrF8Oy2/Xr1/IMkwNsk232cj2FRJIGISptkM7PO72x/e36DivHoQHqPDAYZ3/WdTV 9geCjUz2z2bGNDTti5TyZbxiwowtl462eNOoRFGAiP2jRB1BYWXM9R5TBlNOuXYQGbCi LBJOQRHsgRMXO6JUTKxqQRU1lHdL01Zmfah9Cvet+DjHzS37+PiGvbvmVnvIw88DXcBj mu+o2t4b1Y/7B/OR65Ded3+FTXZphM3zz6lpItXOsqi+70osLaGRVnK13OaBUCVxVnOR VnHA== X-Gm-Message-State: ANoB5plWx0mBOUZWpA1Vw+dH/HjMSNoSDdgyPIcf654waCvHONDwOWsd yTU50mN6BQjX19vE6A1SXa+OZa1//yNukg== X-Received: by 2002:adf:eb05:0:b0:225:8c57:88d0 with SMTP id s5-20020adfeb05000000b002258c5788d0mr10982251wrn.625.1669034806097; Mon, 21 Nov 2022 04:46:46 -0800 (PST) Received: from Dana.systemlords.lan (d51A44418.access.telenet.be. [81.164.68.24]) by smtp.gmail.com with ESMTPSA id v10-20020adfe28a000000b0023647841c5bsm11384358wri.60.2022.11.21.04.46.45 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 21 Nov 2022 04:46:45 -0800 (PST) From: James Darnley To: ffmpeg-devel@ffmpeg.org Date: Mon, 21 Nov 2022 13:44:08 +0100 Message-Id: <20221121124408.1577897-3-jdarnley@obe.tv> X-Mailer: git-send-email 2.38.0 In-Reply-To: <20221121124408.1577897-1-jdarnley@obe.tv> References: <20221121124408.1577897-1-jdarnley@obe.tv> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 3/3] avcodec/v210enc: add new 10-bit function for avx512 avx512icl X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: a53XBdsACzP+ avx512 on Skylake-X (Xeon D-2123IT): 1.19x faster (970±91.2 vs. 817±104.4 decicycles) compared with avx2 avx512icl on Ice Lake (Xeon Silver 4316): 2.52x faster (1350±5.3 vs. 535±9.5 decicycles) compared with avx2 --- libavcodec/x86/v210enc.asm | 99 +++++++++++++++++++++++++++++++++++ libavcodec/x86/v210enc_init.c | 12 +++++ 2 files changed, 111 insertions(+) diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm index c2ad3d72c0..9cee954619 100644 --- a/libavcodec/x86/v210enc.asm +++ b/libavcodec/x86/v210enc.asm @@ -56,6 +56,36 @@ v210enc_8_permd: dd 0,1,4,5, 1,2,5,6 v210enc_8_mult: db 4, 0, 64, 0 v210enc_8_mask: dd 255<<12 +icl_perm_y: ; vpermb does not set bytes to zero when the high bit is set unlike pshufb +%assign i 0 +%rep 8 + db -1,i+0,i+1,-1 , i+2,i+3,i+4,i+5 + %assign i i+6 +%endrep + +icl_perm_uv: ; vpermb does not set bytes to zero when the high bit is set unlike pshufb +%assign i 0 +%rep 4 + db i+0,i+1,i+32,i+33 , -1,i+2,i+3,-1 , i+34,i+35,i+4,i+5 , -1,i+36,i+37,-1 + %assign i i+6 +%endrep + +icl_perm_y_kmask: times 8 db 0b1111_0110 +icl_perm_uv_kmask: times 8 db 0b0110_1111 + +icl_shift_y: times 10 dw 2,0,4 + times 4 db 0 ; padding to 64 bytes +icl_shift_uv: times 5 dw 0,2,4 + times 2 db 0 ; padding to 32 bytes + times 5 dw 4,0,2 + times 2 db 0 ; padding to 32 bytes + +v210enc_10_permd_y: dd 0,1,2,-1 , 3,4,5,-1 +v210enc_10_shufb_y: db -1,0,1,-1 , 2,3,4,5 , -1,6,7,-1 , 8,9,10,11 +v210enc_10_permd_uv: dd 0,1,4,5 , 1,2,5,6 +v210enc_10_shufb_uv: db 0,1, 8, 9 , -1,2,3,-1 , 10,11,4,5 , -1,12,13,-1 + db 2,3,10,11 , -1,4,5,-1 , 12,13,6,7 , -1,14,15,-1 + SECTION .text %macro v210_planar_pack_10 0 @@ -113,6 +143,75 @@ INIT_YMM avx2 v210_planar_pack_10 %endif +%macro v210_planar_pack_10_new 0 + +cglobal v210_planar_pack_10, 5, 5, 8+2*notcpuflag(avx512icl), y, u, v, dst, width + lea yq, [yq+2*widthq] + add uq, widthq + add vq, widthq + neg widthq + + %if cpuflag(avx512icl) + movu m6, [icl_perm_y] + movu m7, [icl_perm_uv] + kmovq k1, [icl_perm_y_kmask] + kmovq k2, [icl_perm_uv_kmask] + %else + movu m6, [v210enc_10_permd_y] + VBROADCASTI128 m7, [v210enc_10_shufb_y] + movu m8, [v210enc_10_permd_uv] + movu m9, [v210enc_10_shufb_uv] + %endif + movu m2, [icl_shift_y] + movu m3, [icl_shift_uv] + VBROADCASTI128 m4, [v210_enc_min_10] ; only ymm sized + VBROADCASTI128 m5, [v210_enc_max_10] ; only ymm sized + + .loop: + movu m0, [yq + widthq*2] + %if cpuflag(avx512icl) + movu ym1, [uq + widthq*1] + vinserti32x8 zm1, [vq + widthq*1], 1 + %else + movu xm1, [uq + widthq*1] + vinserti128 ym1, [vq + widthq*1], 1 + %endif + CLIPW m0, m4, m5 + CLIPW m1, m4, m5 + + vpsllvw m0, m2 + vpsllvw m1, m3 + %if cpuflag(avx512icl) + vpermb m0{k1}{z}, m6, m0 + vpermb m1{k2}{z}, m7, m1 + %else + vpermd m0, m6, m0 + pshufb m0, m7 + vpermd m1, m8, m1 + pshufb m1, m9 + %endif + por m0, m1 + + movu [dstq], m0 + add dstq, mmsize + add widthq, (mmsize*3)/8 + jl .loop +RET + +%endmacro + +%if ARCH_X86_64 +%if HAVE_AVX512_EXTERNAL +INIT_YMM avx512 +v210_planar_pack_10_new +%endif +%endif + +%if HAVE_AVX512ICL_EXTERNAL +INIT_ZMM avx512icl +v210_planar_pack_10_new +%endif + %macro v210_planar_pack_8 0 ; v210_planar_pack_8(const uint8_t *y, const uint8_t *u, const uint8_t *v, uint8_t *dst, ptrdiff_t width) diff --git a/libavcodec/x86/v210enc_init.c b/libavcodec/x86/v210enc_init.c index 6e9f8c6e61..5d1ebcb893 100644 --- a/libavcodec/x86/v210enc_init.c +++ b/libavcodec/x86/v210enc_init.c @@ -37,6 +37,12 @@ void ff_v210_planar_pack_10_ssse3(const uint16_t *y, const uint16_t *u, void ff_v210_planar_pack_10_avx2(const uint16_t *y, const uint16_t *u, const uint16_t *v, uint8_t *dst, ptrdiff_t width); +void ff_v210_planar_pack_10_avx512(const uint16_t *y, const uint16_t *u, + const uint16_t *v, uint8_t *dst, + ptrdiff_t width); +void ff_v210_planar_pack_10_avx512icl(const uint16_t *y, const uint16_t *u, + const uint16_t *v, uint8_t *dst, + ptrdiff_t width); av_cold void ff_v210enc_init_x86(V210EncContext *s) { @@ -60,10 +66,16 @@ av_cold void ff_v210enc_init_x86(V210EncContext *s) if (EXTERNAL_AVX512(cpu_flags)) { s->sample_factor_8 = 2; s->pack_line_8 = ff_v210_planar_pack_8_avx512; +#ifdef ARCH_X86_64 + s->sample_factor_10 = 2; + s->pack_line_10 = ff_v210_planar_pack_10_avx512; +#endif } if (EXTERNAL_AVX512ICL(cpu_flags)) { s->sample_factor_8 = 4; s->pack_line_8 = ff_v210_planar_pack_8_avx512icl; + s->sample_factor_10 = 4; + s->pack_line_10 = ff_v210_planar_pack_10_avx512icl; } } From patchwork Fri Nov 25 15:17:19 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: James Darnley X-Patchwork-Id: 39444 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:999a:b0:a4:2148:650a with SMTP id ve26csp5459376pzb; Fri, 25 Nov 2022 07:20:06 -0800 (PST) X-Google-Smtp-Source: AA0mqf5lOpYTB7FIvBrF3iF2YDJwwi+ZFOLlPEJfRqkYaTQ2sgKNrQOyarMtrsA9nWG55UszsU4W X-Received: by 2002:a2e:9c56:0:b0:277:24b8:9bd4 with SMTP id t22-20020a2e9c56000000b0027724b89bd4mr12513976ljj.470.1669389606616; Fri, 25 Nov 2022 07:20:06 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669389606; cv=none; d=google.com; s=arc-20160816; b=zdsTleXrs+ozAOusLjZ26+t9JfjxhK/Ijyki3gn8BRx2YAzBV5AfZQn4mmbJ6aaUDO o67OM4FgEaaGP9JFRJ6bKaeSquOGjVSP+LrkqXkRBCeJkAwasmEBYRifr1M3YCVlC9Kc 5IfGvCEI0LSgr6ZHSsY+ep77UHds1Lp5tqI8RNGdqnD5LEt0RB2/eu5M6u+ItkTtAKK5 Kv+E7kJbAcYLd6WmjaNZ7xErnuM/PmDEmjnvab7oJbF7ZoE8bFi86P0eOAesNF51Hx65 fX7u6IQfWGEIBa5yYdTuCzOAL5qlhs+61cK4xKaaH7StR1/aJt04CH/TyyCXicLTvcLn v6LA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:references:in-reply-to:message-id :date:to:from:dkim-signature:delivered-to; bh=xN+RH0iMySZDLYFKQQsQVnRLNVxQB77tO45volhYNEU=; b=lB86Bg9dKZKC6SxK4cuesI/5Y37mPiUErTenu+27CXipwUPyBVs/cMuIysqadQg+UN Zi5eeBnyT4jqej9TB9fs6upig1TpBZzCNTEEFZKB4pO0y2Wa2m/bL6QiZ4gSL6ihd7PN fPqEWzxgQmdxnQAxUvXKZxiRh9M84gjZl8Byq4ViDnq7bOHLOyEmD50JLBVksvdvDQKf rWOv1UajdJJXXj1MwVs0U3mDTYKHQiVPjPbw4J8ik8dILNP/7ceoY2pfvb6B9CPh3fmP 3DRRUPeS3xyOSYYLZ5GRynaQIgUIpcoyxCzg/PIWNf2TdD0HEltdfvtzF/Puu54wkq6V IVTQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b=GXumW69U; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id h15-20020a05640250cf00b0043d00293d23si2208351edb.391.2022.11.25.07.20.06; Fri, 25 Nov 2022 07:20:06 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b=GXumW69U; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 6EC3E68BD11; Fri, 25 Nov 2022 17:19:38 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wm1-f46.google.com (mail-wm1-f46.google.com [209.85.128.46]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 27EC468B9DC for ; Fri, 25 Nov 2022 17:19:31 +0200 (EET) Received: by mail-wm1-f46.google.com with SMTP id p16so3709732wmc.3 for ; Fri, 25 Nov 2022 07:19:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=obe-tv.20210112.gappssmtp.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:from:to:cc:subject:date:message-id :reply-to; bh=ibUkxw1i/UxpVeG0Px2QfP7WuJTrGZxCLIqy/K1AuzA=; b=GXumW69Uva/FjOs/yfnGqt9RSqoX6aTmAUrzxRATiaOfT7mEDI7+pzIEnnp1uzgjtX dqSalfDbz8mOjsrmiPMjlst6fYgTKWbDzsTbJa0UKi7Kp+eDcxkT/R/dgaWt9TO/4S93 E3FgfK2oQDJxbdKQepal1N8hjhUI10GaJGZe9nQgXh5JwYsaZ6eAxP3oMP0NVA6RB3xc uLjqFy5H/SzPVx49IqPkXvKwDfwnUgvmMEMa21qvc7ZEwFeCzgvlUk9NJQQvmdqqC9qi l5IR1UAstWj5d6bd4bskd7L7/PcKmY9ty9HMmNQKuMa+GDJD9fqGXibnz/ZJAMdoh3jh uAxg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=ibUkxw1i/UxpVeG0Px2QfP7WuJTrGZxCLIqy/K1AuzA=; b=mtUbTDspoM0TWP2pXT3BuePwBd6Tb9Irmsv1mbb3dvJwQrmG+3keOLDMqBHvFSxhLH 6zyzuKouar7BJvhN1rhdBMePGNFsrCRtIPL8hXzU6xivgEFDsZQQrIxiTZ5UmVnCMrsY PgA/qPrNVhAHjK5g570DPopV0jXY4f4j+Q9eTY+hsdIucquA7vkfPacloBvIvkN1k8aM xIUnfCVcuBbBr8FUnsMSS/JOcVfYzckOFMSf0XM6xJkntb9jeqxtTPRTjkoshC7Qibva 5OszYRewkEnBN1LtYdWwkwCHmk1ar7Wt5YauVS06JND2pa+KOv6VaVHHCBFirFmfIGNR 6boA== X-Gm-Message-State: ANoB5pl535etPbnDHqfxuC9qSbU1hYDck000IOHZOpGj3NpqZ6Wt7WMu R130uY2m18eo/4K6sYyKHiY3imiduH3LUA== X-Received: by 2002:a1c:f712:0:b0:3cc:a9be:d9e9 with SMTP id v18-20020a1cf712000000b003cca9bed9e9mr27697857wmh.173.1669389570508; Fri, 25 Nov 2022 07:19:30 -0800 (PST) Received: from Dana.systemlords.lan (d51A44418.access.telenet.be. [81.164.68.24]) by smtp.gmail.com with ESMTPSA id z4-20020a05600c0a0400b003c70191f267sm10794254wmp.39.2022.11.25.07.19.29 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 25 Nov 2022 07:19:30 -0800 (PST) From: James Darnley To: ffmpeg-devel@ffmpeg.org Date: Fri, 25 Nov 2022 16:17:19 +0100 Message-Id: <20221125151720.1655051-4-jdarnley@obe.tv> X-Mailer: git-send-email 2.38.0 In-Reply-To: <20221121124408.1577897-1-jdarnley@obe.tv> References: <20221121124408.1577897-1-jdarnley@obe.tv> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH v2 4/5] avcodec/x86/v210enc: expand and correct comments X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: IITRlaM7ZJZn --- libavcodec/x86/v210enc.asm | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm index 552164a8be..d3639cd440 100644 --- a/libavcodec/x86/v210enc.asm +++ b/libavcodec/x86/v210enc.asm @@ -314,7 +314,7 @@ cglobal v210_planar_pack_8, 5, 5, 7+notcpuflag(avx512icl), y, u, v, dst, width movu ym1, [yq + 2*widthq] vinserti32x4 m1, [uq + 1*widthq], 2 vinserti32x4 m1, [vq + 1*widthq], 3 - vpermb m1, m2, m1 ; uyv0 yuy0 vyu0 yvy0 + vpermb m1, m2, m1 ; uyvx yuyx vyux yvyx %else movq xm0, [uq + 1*widthq] ; uuuu uuxx movq xm1, [vq + 1*widthq] ; vvvv vvxx @@ -325,10 +325,10 @@ cglobal v210_planar_pack_8, 5, 5, 7+notcpuflag(avx512icl), y, u, v, dst, width %endif CLIPUB m1, m4, m5 - pmaddubsw m0, m1, m3 - pslld m1, 4 + pmaddubsw m0, m1, m3 ; shift high and low samples of each dword and mask out other bits + pslld m1, 4 ; shift center sample of each dword %if cpuflag(avx512) - vpternlogd m0, m1, m6, 0xd8 ; C?B:A + vpternlogd m0, m1, m6, 0xd8 ; C?B:A ; merge and mask out bad bits from B %else pand m1, m6, m1 pandn m0, m6, m0 From patchwork Fri Nov 25 15:17:20 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: James Darnley X-Patchwork-Id: 39445 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a21:999a:b0:a4:2148:650a with SMTP id ve26csp5459534pzb; Fri, 25 Nov 2022 07:20:17 -0800 (PST) X-Google-Smtp-Source: AA0mqf4j1zmdrmac1m7PyNh8gt2SNew8aj/RC0aPX2eDYwP4O7AJAA723WFXgVDzkLG5BkChUmtd X-Received: by 2002:a17:906:2404:b0:7ad:d411:30af with SMTP id z4-20020a170906240400b007add41130afmr32578899eja.636.1669389616803; Fri, 25 Nov 2022 07:20:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669389616; cv=none; d=google.com; s=arc-20160816; b=G2wjCChMW1E3m53N/9xWZKQQ8/9vnSsE79q3e+1MVSfqwWAgqNpgQboR5c6hLHhGjS uKc1Y7OP6gGydwn0rPOd15INOh17k3IrvA1ynNZrXOqZENwttgH0KoHRnQD0cJ7mKdBO Rymvn2BdZQCWH1LF2Tmjxd7g/644DeSetz4WHVqWwb8bgXOP1Zga9M2IYsf6YxyOIU4/ 9QqZOW2iTvAFx5Vm4YfDh1NxrkyO7CYbtQ6wubGZs9uwD78ogaywAyQzQBzlzRzG4w4J j9ahpKMlU/mP6WBNaOjQfGWgpVQ7Phrvxk+JozQ/flZIg+8oQLC0QiVJ+Eo8AgTYiP8i 78jA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:references:in-reply-to:message-id :date:to:from:dkim-signature:delivered-to; bh=PJKGwKiBqqrdq6Fh+yfD8Wp04WmB7M6Db2VlxqTBpTg=; b=oZ29KPeUxX0hOEJW+0RVOV4HMN2YxyyZYFoap6U5VYoAIIt7RVPUKVWQSzdudU/9dJ Nh4hpSXGwg3KkIO4nOW/QS3XspqmwE5ct0/fr7F/2dzlfv6xPX18rthjd7WgQxVVT4KM orOmFN/WXEOov8z2juL4vCgGXgQ2X8KUw1pQtoguKg9i9iOT5CedDHxHN8FVcHlCOZHQ ItZAX8pq8xxBuhLde42SX2huwbZW7uVA0bBJOZiXrkw2o3xTw4pGVtIULvUzskEgatQL MJPITB94PQfkxuDiBVbttGHAIfsaVDKi/ITy+6pftw28SkrVLGrn9PkAGKZCkJv7MhfJ YdNA== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b=a+JyRaFg; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id y12-20020a056402358c00b0046aadad5521si2899818edc.449.2022.11.25.07.20.16; Fri, 25 Nov 2022 07:20:16 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@obe-tv.20210112.gappssmtp.com header.s=20210112 header.b=a+JyRaFg; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 73FBD68BD16; Fri, 25 Nov 2022 17:19:39 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-wm1-f52.google.com (mail-wm1-f52.google.com [209.85.128.52]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id CF96968BD00 for ; Fri, 25 Nov 2022 17:19:31 +0200 (EET) Received: by mail-wm1-f52.google.com with SMTP id o7-20020a05600c510700b003cffc0b3374so3657220wms.0 for ; Fri, 25 Nov 2022 07:19:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=obe-tv.20210112.gappssmtp.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:from:to:cc:subject:date:message-id :reply-to; bh=dOTC1r4qyO4EQ+mVvM+Sl4kh8dNLi7bF6wMU1ym47vk=; b=a+JyRaFgukON0kUQ/Jg4Yoik8qtG24PaNtYudVJEs61HQc1Zqi4tslygglb4BWuBxB CoHbsFam/sw1LaaDAsgYL/w9f6mHIWz0H/R8qIEryXDpcbHZ8gY1LrMJCZ6gXJUy11jL a53md5QAlrTZahGJnhC3f/O/WXQ5+ylcaRjl1JmR602ulI3T4odnJ21EfLF5AEyC2oNd 1LHYkIBvmA+6Wy8T8rZ4xXb7Fp3RqaIPEdAMx0I8jM1TUgjlb10dvzs/zKrO29M30Epa 8aMPO075isX4Tt6jJywmsRP2idTuGZPR1HuIYrQgVwiFX4EZMEHB2w6NZ0/4KxZBcWc8 BOfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=dOTC1r4qyO4EQ+mVvM+Sl4kh8dNLi7bF6wMU1ym47vk=; b=LtKxXvB0RBzecWCB1tAV1dOl3mBvCE+1ft/h/bltClza96w4Uxk/zREhEhbPkgLeRL lcXcu2PoMzJA6mj3SoHXrNhXvUPQC69Ff53E1eT5WUNa7RRd/yPhShU7GB7ixG3t8Otk obB2BmDliJ1iAiUhe31aELmic52O/8m2zjpizDzycbVUwNQKwZhwBc6kHWAemL3soVUw kxW84wGPiYSnZFuprIUCLiM25hk4gxEvOOKbRUaAw0veVyeX7p4XPAfjsq2aX6QUr4vh +3Y1MzvjGJLWtcP77p30AQnwQQf0Rz6zvffLWhOtnHPFeDXII0rl8PVGKY9FTCEEEm2c TTFg== X-Gm-Message-State: ANoB5pkJELo2DGojQOlSjXoC/NGPkwbiUyLQuDta1wPimkJvXnnSg5Sz KlYwck2iZQ/WjgLZY0yPniT+qZiyaySBSg== X-Received: by 2002:a1c:7709:0:b0:3cf:6a83:c7a3 with SMTP id t9-20020a1c7709000000b003cf6a83c7a3mr26753006wmi.21.1669389571254; Fri, 25 Nov 2022 07:19:31 -0800 (PST) Received: from Dana.systemlords.lan (d51A44418.access.telenet.be. [81.164.68.24]) by smtp.gmail.com with ESMTPSA id z4-20020a05600c0a0400b003c70191f267sm10794254wmp.39.2022.11.25.07.19.30 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 25 Nov 2022 07:19:30 -0800 (PST) From: James Darnley To: ffmpeg-devel@ffmpeg.org Date: Fri, 25 Nov 2022 16:17:20 +0100 Message-Id: <20221125151720.1655051-5-jdarnley@obe.tv> X-Mailer: git-send-email 2.38.0 In-Reply-To: <20221121124408.1577897-1-jdarnley@obe.tv> References: <20221121124408.1577897-1-jdarnley@obe.tv> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH v2 5/5] avcodec/x86/v210enc: remove unneeded instruction X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: 4VfszryW1rqQ --- libavcodec/x86/v210enc.asm | 1 - 1 file changed, 1 deletion(-) diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm index d3639cd440..daf5f2ab81 100644 --- a/libavcodec/x86/v210enc.asm +++ b/libavcodec/x86/v210enc.asm @@ -331,7 +331,6 @@ cglobal v210_planar_pack_8, 5, 5, 7+notcpuflag(avx512icl), y, u, v, dst, width vpternlogd m0, m1, m6, 0xd8 ; C?B:A ; merge and mask out bad bits from B %else pand m1, m6, m1 - pandn m0, m6, m0 por m0, m0, m1 %endif