From patchwork Mon Jul 31 06:37:03 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: kaustubh.raste@imgtec.com X-Patchwork-Id: 4527 Delivered-To: ffmpegpatchwork@gmail.com Received: by 10.103.1.85 with SMTP id 82csp2061021vsb; Sun, 30 Jul 2017 23:36:29 -0700 (PDT) X-Received: by 10.28.132.13 with SMTP id g13mr10007539wmd.58.1501482989882; Sun, 30 Jul 2017 23:36:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1501482989; cv=none; d=google.com; s=arc-20160816; b=maCj6BLrIyHB6zAKvgg+AZy5Brc7gMyAsWjqBNAQYiVQfZZvs/Nrr+9v4luO/E2e2I veTZ76WIaY4A8d+RYo/pbfJUuk4I9o0e+CSYLfAnRK0wZ6XMVF4GvHNDU8w+Mvxn++El nOKFWHxyx3C552qUetN99t3uSPZf22FjP43OGX7tFR13Z3mzUbK7N6VbNge6tg67lmFA 1cVbikcDylJgTsSj/HnfDolKYzx0kA6Xbl00D8IhaQuUhJOsU5W/d3b+xxaHtX0INsoM a0yjOHih1XC5Hmlt6oUjqy/EG0HFIlutY6eSX/qNpD/w56lzYAly/BPHnUfEAnHQhSxN oyPQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:content-transfer-encoding:cc:reply-to :list-subscribe:list-help:list-post:list-archive:list-unsubscribe :list-id:precedence:subject:mime-version:message-id:date:to:from :delivered-to:arc-authentication-results; bh=0EQKvwB1vl98s5+/+mVzsnBisdxEUo+U8LUyaACJzFw=; b=enLU+scCasTQfHzbQSjHp9vr09WR0y0azbES7EFLZ+QRLsmCnWv48LRkLb6SfQV/gb uFkRZW8FiCU6GaAQGPhwBc0XFiZMv8Wdc07r5ZGqN5vxXzO28IeD2qf6rRVRT1B+uyQW wS5eRoupHqH7VR0SxUPiyQdoDrNi6kgaskUiEBBEogICEUse3q1SNXlBjTNIkEwJdAq3 fIZWOhbtA77x1JcrXtNxtHiNQmk5fshm+1NjD+lXdX1dRlBB145peu/5VK/hgZ4iFOx8 FdhiLhoyoR35LmySbTZ6OcwFanIvqf7xNC7ov7+J9Tj86Tf3GFD0ViFK4C/EJUzCjT5d 5INA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org. [79.124.17.100]) by mx.google.com with ESMTP id k100si25486122wrc.138.2017.07.30.23.36.29; Sun, 30 Jul 2017 23:36:29 -0700 (PDT) Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; Authentication-Results: mx.google.com; spf=pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) smtp.mailfrom=ffmpeg-devel-bounces@ffmpeg.org Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 4A7EE68A133; Mon, 31 Jul 2017 09:36:24 +0300 (EEST) X-Original-To: ffmpeg-devel@ffmpeg.org Delivered-To: ffmpeg-devel@ffmpeg.org Received: from mailapp01.imgtec.com (mailapp01.imgtec.com [195.59.15.196]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 7788D689FB8 for ; Mon, 31 Jul 2017 09:36:17 +0300 (EEST) Received: from hhmail02.hh.imgtec.org (unknown [10.100.10.20]) by Forcepoint Email with ESMTPS id 5D2A068ADDAEB for ; Mon, 31 Jul 2017 07:36:16 +0100 (IST) Received: from pudesk204.pu.imgtec.org (192.168.91.13) by hhmail02.hh.imgtec.org (10.100.10.20) with Microsoft SMTP Server (TLS) id 14.3.294.0; Mon, 31 Jul 2017 07:36:18 +0100 From: To: Date: Mon, 31 Jul 2017 12:07:03 +0530 Message-ID: <1501483023-30703-1-git-send-email-kaustubh.raste@imgtec.com> X-Mailer: git-send-email 1.7.9.5 MIME-Version: 1.0 X-Originating-IP: [192.168.91.13] Subject: [FFmpeg-devel] [PATCH] libavcodec/mips: Improve avc idct8 msa function X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Kaustubh Raste Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" From: Kaustubh Raste Replace memset call with msa stores. Signed-off-by: Kaustubh Raste --- libavcodec/mips/h264idct_msa.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/libavcodec/mips/h264idct_msa.c b/libavcodec/mips/h264idct_msa.c index 861befe..1e1a5c8 100644 --- a/libavcodec/mips/h264idct_msa.c +++ b/libavcodec/mips/h264idct_msa.c @@ -120,11 +120,12 @@ static void avc_idct8_addblk_msa(uint8_t *dst, int16_t *src, int32_t dst_stride) v4i32 res0_r, res1_r, res2_r, res3_r, res4_r, res5_r, res6_r, res7_r; v4i32 res0_l, res1_l, res2_l, res3_l, res4_l, res5_l, res6_l, res7_l; v16i8 dst0, dst1, dst2, dst3, dst4, dst5, dst6, dst7; - v16i8 zeros = { 0 }; + v8i16 zeros = { 0 }; src[0] += 32; LD_SH8(src, 8, src0, src1, src2, src3, src4, src5, src6, src7); + ST_SH8(zeros, zeros, zeros, zeros, zeros, zeros, zeros, zeros, src, 8); vec0 = src0 + src4; vec1 = src0 - src4; @@ -318,7 +319,6 @@ void ff_h264_idct8_addblk_msa(uint8_t *dst, int16_t *src, int32_t dst_stride) { avc_idct8_addblk_msa(dst, src, dst_stride); - memset(src, 0, 64 * sizeof(dctcoef)); } void ff_h264_idct4x4_addblk_dc_msa(uint8_t *dst, int16_t *src,