[FFmpeg-devel] Moves yuv2yuvX_sse3 to yasm, unrolls main loop and other small optimizations for ~20% speedup. AVX2 version is ready and tested, although local tests show a significant speed-up in this function using avx2, swscale code slows down overall p

Date: Thu, 22 Oct 2020 09:43:53 +0200
In-Reply-To: <20200915161158.2051301-1-alankelly@google.com>
Message-Id: <20201022074353.2333866-1-alankelly@google.com>
Mime-Version: 1.0
References: <20200915161158.2051301-1-alankelly@google.com>
From: Alan Kelly <alankelly-at-google.com@ffmpeg.org>
To: ffmpeg-devel@ffmpeg.org
Subject: [FFmpeg-devel] [PATCH] Moves yuv2yuvX_sse3 to yasm,
 unrolls main loop and other small optimizations for ~20% speedup.
 AVX2 version is ready and tested,
 although local tests show a significant speed-up in this function using avx2,
 swscale code slows down overall probably due cpu frequency scaling.
Precedence: list
Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Cc: Alan Kelly <alankelly@google.com>
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: ffmpeg-devel-bounces@ffmpeg.org
Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>

Message ID	20201022074353.2333866-1-alankelly@google.com
State	Superseded
Headers	show Return-Path: <ffmpeg-devel-bounces@ffmpeg.org> Date: Thu, 22 Oct 2020 09:43:53 +0200 In-Reply-To: <20200915161158.2051301-1-alankelly@google.com> Message-Id: <20201022074353.2333866-1-alankelly@google.com> Mime-Version: 1.0 References: <20200915161158.2051301-1-alankelly@google.com> From: Alan Kelly <alankelly-at-google.com@ffmpeg.org> To: ffmpeg-devel@ffmpeg.org Subject: [FFmpeg-devel] [PATCH] Moves yuv2yuvX_sse3 to yasm, unrolls main loop and other small optimizations for ~20% speedup. AVX2 version is ready and tested, although local tests show a significant speed-up in this function using avx2, swscale code slows down overall probably due cpu frequency scaling. Precedence: list Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org> Cc: Alan Kelly <alankelly@google.com> Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>
Series	[FFmpeg-devel] Moves yuv2yuvX_sse3 to yasm, unrolls main loop and other small optimizations for ~20% speedup. AVX2 version is ready and tested, although local tests show a significant speed-up in this function using avx2, swscale code slows down overall p \| expand [FFmpeg-devel] Moves yuv2yuvX_sse3 to yasm, unrolls main loop and other small optimizations for ~20…

Context	Check	Description
andriy/x86_make	success	Make finished
andriy/x86_make_fate	success	Make fate finished
andriy/PPC64_make	success	Make finished
andriy/PPC64_make_fate	success	Make fate finished

[FFmpeg-devel] Moves yuv2yuvX_sse3 to yasm, unrolls main loop and other small optimizations for ~20% speedup. AVX2 version is ready and tested, although local tests show a significant speed-up in this function using avx2, swscale code slows down overall p

Checks

Commit Message

Comments

Patch