harmony-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mikhail Fursov (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HARMONY-5599) [drlvm][jit][performance] new movapd instruction for register-to-register copy
Date Thu, 13 Mar 2008 15:12:24 GMT

    [ https://issues.apache.org/jira/browse/HARMONY-5599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578331#action_12578331
] 

Mikhail Fursov commented on HARMONY-5599:
-----------------------------------------

The patch needs to be fixed.
Reason:
AFAIK MOVAPD is SSE2 instruction. So we need to use old (SSE) way if SSE2 is not available.

Check Ia32i586InstsExpansion code to see how CPUID check is performed.

> [drlvm][jit][performance] new movapd instruction for register-to-register copy
> ------------------------------------------------------------------------------
>
>                 Key: HARMONY-5599
>                 URL: https://issues.apache.org/jira/browse/HARMONY-5599
>             Project: Harmony
>          Issue Type: Improvement
>          Components: DRLVM
>            Reporter: Vladimir Strigun
>            Assignee: Mikhail Fursov
>         Attachments: HARMONY-5599.patch
>
>
> Usage of movapd instruction for copy between xmm registers is more efficient in copmarison
with partial movsd copy. So, attached patch replace movsd with movadp instruction for such
operations. 
> I've checked the patch on scimark bench [1] and got the following results (about 15%
speedup for composite score):
> orig build:
> SciMark 2.0a
> Composite Score: 236.8043350027899
> FFT (1024): 266.4183025101507
> SOR (100x100):   410.3833460433766
> Monte Carlo : 31.43640457526972
> Sparse matmult (N=1000, nz=5000): 208.14991492655557
> LU (100x100): 267.6337069585971
> with movapd:
> SciMark 2.0a
> Composite Score: 271.62584550328904
> FFT (1024): 296.11079189672955
> SOR (100x100):   458.00820213602486
> Monte Carlo : 31.406979573247035
> Sparse matmult (N=1000, nz=5000): 208.14991492655557
> LU (100x100): 364.453338983888

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message