Question with matching program in SSE

I am a college student studying image processing..

Nowadays, i have been studying code optimization with ICC

But, i have met a huge problem...

This is the problem...

i am sutdying matching algorithm.. i have got the last

mathcing point.. but i can't show the result...

i want to input a certain value to the specific location

of an array... but whenever i do that, time complexity

was really high... i can't understand why... plz tell me

how i can fix this problem.. there is a source in

attached file... my proc is p4 and memory is DDR 512MB

