From patchwork Fri Aug 4 11:54:11 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: kaustubh.raste@imgtec.com X-Patchwork-Id: 4622 Delivered-To: ffmpegpatchwork@gmail.com Received: by 10.103.46.211 with SMTP id u202csp2166300vsu; Fri, 4 Aug 2017 04:54:25 -0700 (PDT) X-Received: by 10.223.170.140 with SMTP id h12mr1437889wrc.79.1501847665826; Fri, 04 Aug 2017 04:54:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1501847665; cv=none; d=google.com; s=arc-20160816; b=aNigH4VjFtCc9AWBkt2B90QfOxAvWkuNQsD3CCQ+UVkD2VdElHCEKjsNDBnGUotIp2 6wAbvPzjYpQrzDY1mH7x+7bUGgHboAEfLm6lYA/gH3DYRi7Gojju2XFoclmtjNd9RICs wJ7MKUfhalDUWCr9el27shL/G17qAEfQ/JykmFXBLB7qY2D2d6xrPnxfGvCPH+gyQRJR VTZ+mzzdwBKZIKp2y8XPP9+3YtPFla5J4YWeXzQqeUDxNzd/SStnsAdRLs9fhT0oN65C PEy40PEg7qUiZDyR19nGmxqjeQ3lWqVBLpBVVLeBKHL0LW16rsplDjJQrjWFqa/PXvHz e8OQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:reply-to:list-subscribe :list-help:list-post:list-archive:list-unsubscribe:list-id :precedence:subject:mime-version:content-language:accept-language :in-reply-to:references:message-id:date:thread-index:thread-topic:to :from:delivered-to:arc-authentication-results; bh=dZeW8CTjbHdlacmeQTABML6OpKzl/LULFbDTmNeLYU0=; b=GagyunRh/Dte6hlyMIvj2Z372de0D43ojf1CM/5d0fIbbNaKAfshRoBEfqsGzYHVvt /V416wn66M5Q/qTVQDxaY1zgRKF228ECUvOYdAh8DfucAhvvu3YxMx6LSARHq2hCPWrX kss6QzNWLp/c3qfh6ofUvWrcESzsyCbr32eHbfZ2RAJ2KZxndSBh34OHIR87DN5JAEaj s2ClEBoW0XsuzbtimTM1CdIeoJu4GpHUhh6hi980j5OUk2G8PNwcoH/+xjtCnx0z7AT6 mAexmnm2sMgG37jIsxdAQajdWMVWlR2CwuvONZLnAMEe8F/CtdxJHx/BE58Uvb4LPxKm 63cA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id b185si3024151wmc.261.2017.08.04.04.54.25; Fri, 04 Aug 2017 04:54:25 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 2FB126898B1; Fri, 4 Aug 2017 14:54:22 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mailapp01.imgtec.com (mailapp01.imgtec.com [195.59.15.196]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 20B4E689827 for ; Fri, 4 Aug 2017 14:54:16 +0300 (EEST) Received: from HHMAIL01.hh.imgtec.org (unknown [10.100.10.19]) by Forcepoint Email with ESMTPS id 7F3A8190E157A for ; Fri, 4 Aug 2017 12:54:11 +0100 (IST) Received: from PUMAIL01.pu.imgtec.org (192.168.91.250) by HHMAIL01.hh.imgtec.org (10.100.10.19) with Microsoft SMTP Server (TLS) id 14.3.294.0; Fri, 4 Aug 2017 12:54:14 +0100 Received: from PUMAIL01.pu.imgtec.org ([::1]) by PUMAIL01.pu.imgtec.org ([::1]) with mapi id 14.03.0266.001; Fri, 4 Aug 2017 17:24:12 +0530 From: Kaustubh Raste To: FFmpeg development discussions and patches Thread-Topic: [FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc idct8 msa function Thread-Index: AQHTCeWcLjpFZfG/vUexAU66sUyKv6J0HNow Date: Fri, 4 Aug 2017 11:54:11 +0000 Message-ID: <7AC45BA9D7010549B1787997D11B8C30E0864761@PUMAIL01.pu.imgtec.org> References: <1501483023-30703-1-git-send-email-kaustubh.raste@imgtec.com> <70293ACCC3BA6A4E81FFCA024C7A86E1E058C582@PUMAIL01.pu.imgtec.org> In-Reply-To: <70293ACCC3BA6A4E81FFCA024C7A86E1E058C582@PUMAIL01.pu.imgtec.org> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [192.168.91.73] MIME-Version: 1.0 Subject: Re: [FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc idct8 msa function X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Ping. -----Original Message----- From: Manojkumar Bhosale Sent: Monday, July 31, 2017 3:43 PM To: FFmpeg development discussions and patches Cc: Kaustubh Raste Subject: RE: [FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc idct8 msa function LGTM thx -----Original Message----- From: ffmpeg-devel [mailto:ffmpeg-devel-bounces@ffmpeg.org] On Behalf Of kaustubh.raste@imgtec.com Sent: Monday, July 31, 2017 12:07 PM To: ffmpeg-devel@ffmpeg.org Cc: Kaustubh Raste Subject: [FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc idct8 msa function From: Kaustubh Raste Replace memset call with msa stores. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264idct_msa.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel diff --git a/libavcodec/mips/h264idct_msa.c b/libavcodec/mips/h264idct_msa.c index 861befe..1e1a5c8 100644 --- a/libavcodec/mips/h264idct_msa.c +++ b/libavcodec/mips/h264idct_msa.c @@ -120,11 +120,12 @@ static void avc_idct8_addblk_msa(uint8_t *dst, int16_t *src, int32_t dst_stride) v4i32 res0_r, res1_r, res2_r, res3_r, res4_r, res5_r, res6_r, res7_r; v4i32 res0_l, res1_l, res2_l, res3_l, res4_l, res5_l, res6_l, res7_l; v16i8 dst0, dst1, dst2, dst3, dst4, dst5, dst6, dst7; - v16i8 zeros = { 0 }; + v8i16 zeros = { 0 }; src[0] += 32; LD_SH8(src, 8, src0, src1, src2, src3, src4, src5, src6, src7); + ST_SH8(zeros, zeros, zeros, zeros, zeros, zeros, zeros, zeros, src, + 8); vec0 = src0 + src4; vec1 = src0 - src4; @@ -318,7 +319,6 @@ void ff_h264_idct8_addblk_msa(uint8_t *dst, int16_t *src, int32_t dst_stride) { avc_idct8_addblk_msa(dst, src, dst_stride); - memset(src, 0, 64 * sizeof(dctcoef)); } void ff_h264_idct4x4_addblk_dc_msa(uint8_t *dst, int16_t *src, -- 1.7.9.5