From patchwork Thu Jan 18 16:33:41 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Lynne X-Patchwork-Id: 45655 Delivered-To: ffmpegpatchwork2@gmail.com Received: by 2002:a05:6a20:968f:b0:199:de12:6fa6 with SMTP id hp15csp382826pzc; Thu, 18 Jan 2024 08:34:00 -0800 (PST) X-Google-Smtp-Source: AGHT+IGWfS6sk5S/lNRPYzBA89+G9JvSOe9Z/8TszI6HLqC7tDSLidkh/C8bsUSqMxynoe8YxACK X-Received: by 2002:a17:906:4547:b0:a2d:897f:5a0b with SMTP id s7-20020a170906454700b00a2d897f5a0bmr1444692ejq.5.1705595639642; Thu, 18 Jan 2024 08:33:59 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1705595639; cv=none; d=google.com; s=arc-20160816; b=A183Qd3cadm49eZqkpEq9oR0LJ1LPnqS1c71ej/y48osw0GKxWPGV5WmX6lvxE4I+D zjgI4Owjmz99MY5etoFWpcfRoW2BI5hj5jPivEtq9Eg6BAi/zBl2e1JodM9vc6v2lJqL /0tCjR03PqHnCJzkgSfQh9czzOSDnDvemSYRfKVbjvLsgit/7mfRs4O5WVdX+/wauPz+ KH9w20s7pxnYP2h+PJxtuRCQna3E+nrzgZIRNK3N3ap7ssw79Il8Nwv0bpnEdRHceWq3 ZrBduSA5LUGjiUHRzNyhL3gghLRmNLg2b1w5+jFdDnk8ROUff0tG9V+/L9o0Ueo3kgY5 MfOg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence:subject :mime-version:message-id:to:from:date:dkim-signature:delivered-to; bh=y+mOo5nO9igsE9/OooJdXP8oZeLoN0uvFe5xcd7Wc/A=; fh=Q46kXK7oI5D1Jhi90JBr53c7NIaTxGaU4KPeRZyM/hI=; b=DLlDwh6AuCBUbDpWi0W+A9PV3SIznlS2AKIRKbCAbIx6qUxdLgw8krmW55N4b//h6o fzY+HGTK0k7XQ43NI7nFyDlgPEo1zPYgDJcSFKitaodiLxmamNDQczgf2GtoegsJ3zP1 O2GHh8djr/+TKi7xiEl8GR9+eJfSINSIRfivP6KCP1rHzjpN4JXB1zBOP+CquWNyDuQ1 C8NXJDaqcbKjpahzCZG+jIq+Gbqnd6/SgE32yL19/WFq/lkMfG5Ff0WePvDVMOZJrA5r NK4sAZ2yG3/T1DdFIIs1LC7NQHj9PIsYsrSS7GDcDU3wV8DfWu1WB7pIHg940ax9MB9J gkig== ARC-Authentication-Results: i=1; mx.google.com; dkim=neutral (body hash did not verify) header.i=@lynne.ee header.s=s1 header.b=cRY7Jfr1; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=lynne.ee Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id f16-20020a170906495000b00a293dbc805fsi6778382ejt.422.2024.01.18.08.33.53; Thu, 18 Jan 2024 08:33:59 -0800 (PST) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; dkim=neutral (body hash did not verify) header.i=@lynne.ee header.s=s1 header.b=cRY7Jfr1; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=lynne.ee Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 5D3A3680276; Thu, 18 Jan 2024 18:33:49 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from w4.tutanota.de (w4.tutanota.de [81.3.6.165]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id D064E68CD64 for ; Thu, 18 Jan 2024 18:33:42 +0200 (EET) Received: from tutadb.w10.tutanota.de (unknown [192.168.1.10]) by w4.tutanota.de (Postfix) with ESMTP id 1BF9510602DA for ; Thu, 18 Jan 2024 16:33:42 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; t=1705595621; s=s1; d=lynne.ee; h=From:From:To:To:Subject:Subject:Content-Description:Content-ID:Content-Type:Content-Type:Content-Transfer-Encoding:Cc:Date:Date:In-Reply-To:MIME-Version:MIME-Version:Message-ID:Message-ID:Reply-To:References:Sender; bh=vlE/hglZmTdZtC/r0iJqcRk51GVwpnan+PcHb4Tbqfc=; b=cRY7Jfr1LraKp/9igw49yzYwJ4VrMHKoqJkTVFxhs25caBSNV1etcXJerZ+VCObU J9SHOmmqLYk3WLE3I7P8mxzOiDn2Zq/z7Za0pOSOyV30hBmGcS+2r4UEux5kuc6XYpZ a8UWt1PvK3fd0/PhS/B/s/XENZ7k0w8kV7ICJ13nvuBcsOhB+/jg4iRQjucXfe2ileo RFXtgKI3nc7ZM5WFC/bZOFpV/Yh4iqIuOiXndIu31sETfVoMGohsSovWUR4ifH8fLj+ ZGdOxe1oWjyuZFGRdXU8HGfvicbooKZt/EaEztHT8eRnbmhjfRrOgxACbRFlS6hvS7S xLOAc2hljA== Date: Thu, 18 Jan 2024 17:33:41 +0100 (CET) From: Lynne To: Ffmpeg Devel Message-ID: MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH] x86/tx_float: enable SIMD for sizes over 131072 X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" X-TUID: p8GDkilI3s/Q The tables for the new sizes were added last year due to being required for SDR. However, the assembly was never updated to use them. Patch attached. From ccfd9366025105a7dba0471965856b12d73bbd17 Mon Sep 17 00:00:00 2001 From: Lynne Date: Thu, 18 Jan 2024 17:30:29 +0100 Subject: [PATCH] x86/tx_float: enable SIMD for sizes over 131072 The tables for the new sizes were added last year due to being required for SDR. However, the assembly was never updated to use them. --- libavutil/x86/tx_float.asm | 8 ++++++-- 1 file changed, 6 insertions(+), 2 deletions(-) diff --git a/libavutil/x86/tx_float.asm b/libavutil/x86/tx_float.asm index e1533a8595..42006848f1 100644 --- a/libavutil/x86/tx_float.asm +++ b/libavutil/x86/tx_float.asm @@ -46,7 +46,7 @@ %endif %assign i 16 -%rep 14 +%rep 18 cextern tab_ %+ i %+ _float ; ff_tab_i_float... %assign i (i << 1) %endrep @@ -1385,7 +1385,11 @@ FFT_SPLIT_RADIX_DEF 8192, .16384pt FFT_SPLIT_RADIX_DEF 16384, .32768pt FFT_SPLIT_RADIX_DEF 32768, .65536pt FFT_SPLIT_RADIX_DEF 65536, .131072pt -FFT_SPLIT_RADIX_DEF 131072 +FFT_SPLIT_RADIX_DEF 131072, .262144pt +FFT_SPLIT_RADIX_DEF 262144, .524288pt +FFT_SPLIT_RADIX_DEF 524288, .1048576pt +FFT_SPLIT_RADIX_DEF 1048576, .2097152pt +FFT_SPLIT_RADIX_DEF 2097152 ;=============================================================================== ; Final synthesis + deinterleaving code -- 2.43.0