From patchwork Thu Jan 3 00:43:54 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: James Almer X-Patchwork-Id: 11637 Return-Path: X-Original-To: patchwork@ffaux-bg.ffmpeg.org Delivered-To: patchwork@ffaux-bg.ffmpeg.org Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by ffaux.localdomain (Postfix) with ESMTP id 82FDA44A0F6 for ; Thu, 3 Jan 2019 02:44:12 +0200 (EET) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 3AEEB68A13B; Thu, 3 Jan 2019 02:44:09 +0200 (EET) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mail-qt1-f195.google.com (mail-qt1-f195.google.com [209.85.160.195]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 95FBA689DC4 for ; Thu, 3 Jan 2019 02:44:02 +0200 (EET) Received: by mail-qt1-f195.google.com with SMTP id l11so35421994qtp.0 for ; Wed, 02 Jan 2019 16:44:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:subject:date:message-id:mime-version :content-transfer-encoding; bh=GwVO/T3rlHgnZ+nqkQleMRJPXqGQqJGp7y9m1uJDifw=; b=dz7Wu3yvONpqN4a5XSOqf1pMuAoWvltd+swjhKNs1fFn+bjucqcnx8m5tVt5e2uLDL tYIjBBbKqpB6jp6WJKIN4m8IEBVoPNrdO+pz5s7OigyUvQ463G1Z3N8MWOrNa83xRzG+ XZ1gSBDiwoFXDRG7ZhcFXVF6Gfwaa0ja3pcZB045uJbLTys0LJKnz59J7FuTO7J1dWy5 qKUWNdAIK5dfU5PJwxUeh/5Q2Pu3BL159713UbRwl9eMOkh2KDctpLwwnw+yzaPhP9jt lkm/Bqwjm3tYXmeV4jThpifcKFRSpkW52jsMmV9L4L3l+WgXQgCPb91rkT22RA87jaEc dtKQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:subject:date:message-id:mime-version :content-transfer-encoding; bh=GwVO/T3rlHgnZ+nqkQleMRJPXqGQqJGp7y9m1uJDifw=; b=RyHe/eMp9PWIwW6UhlvuFZIGDqm9/wd76RaFDEkgJMAuyxwGq9d6mKEvpXqzpoJtA0 c7hBj3hCRB+ySnIErQrfNzCrtcnbZTT5OfPpfXl20he8xsnwsLuQgOxToQRNUXKp0fAd uZbrRNRhTAR5EECLt8F533F7oX7tC0ggWMng1lF0bi7ebFs7Q5opP5nSfDvOwWljRXma 3uT8YbUy8txlxKI+DZnfssVeErO99iUusZmjiHxkCtM6YTOjh/H8igsoOW6cKoORL8eY REoUCYqzLDIjga6ih4+Rq+mwmhPNOZYeRWEwEy1GuksZx7fYwLAjmRCP+h3wNL9tuUkq RgCA== X-Gm-Message-State: AA+aEWacK+IM3Ar1oTWaiabycNAZwPclrVAQyTJQHS+nkRGn2TXUpzSm 4QD8uDP8ci+3NbBorz2NvuRCuF/z X-Google-Smtp-Source: AFSGD/X234WiTcFe+baB6PERUYo7MTsyPsh5xDzitQic3dOQYeRgfPGhVElJ+bxlCuenMjs9jwJPPw== X-Received: by 2002:ac8:2a81:: with SMTP id b1mr44904287qta.110.1546476250030; Wed, 02 Jan 2019 16:44:10 -0800 (PST) Received: from localhost.localdomain ([179.38.167.144]) by smtp.gmail.com with ESMTPSA id o34sm23090515qte.4.2019.01.02.16.44.08 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 02 Jan 2019 16:44:09 -0800 (PST) From: James Almer To: ffmpeg-devel@ffmpeg.org Date: Wed, 2 Jan 2019 21:43:54 -0300 Message-Id: <20190103004357.5604-1-jamrial@gmail.com> X-Mailer: git-send-email 2.20.1 MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 1/4] x86/af_afir: fix processing the last element X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" ff_fcmul_add_sse3() is now identical to the C version. Signed-off-by: James Almer --- libavfilter/x86/af_afir.asm | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/libavfilter/x86/af_afir.asm b/libavfilter/x86/af_afir.asm index 849d85e70f..fcc1f426db 100644 --- a/libavfilter/x86/af_afir.asm +++ b/libavfilter/x86/af_afir.asm @@ -30,7 +30,6 @@ SECTION .text INIT_XMM sse3 cglobal fcmul_add, 4,4,6, sum, t, c, len shl lend, 3 - add lend, mmsize*2 add tq, lenq add cq, lenq add sumq, lenq @@ -57,4 +56,8 @@ ALIGN 16 movaps [sumq + lenq+mmsize], m3 add lenq, mmsize*2 jl .loop - REP_RET + movss xm0, [tq + lenq] + mulss xm0, [cq + lenq] + addss xm0, [sumq + lenq] + movss [sumq + lenq], xm0 + RET