Information on SSE4 including some whitepapers is available at www.intel.com/software/avx while SSE3 and SSE4 are both documented in the software developer manuals http://www.intel.com/products/processor/manuals/- I'd recommend you look mainly at volumes 2a and 2b for details of the instructions. Can you speak more of the problem you're trying to solve?
Untill now, I am not use SSE3 and SSe4 in my application. I just want to have some knowledge about them. Whem I have further tests of them, maybe I will have some problems to share with you.
Where can I find documents about SSE3 and SSE4 for early study, or the website link?
Thanks.
I would suggest to read "IA-32 Intel Architecture Optimization Reference Manual" to start with. Followed by "Intel Intrinsic Functions Reference" & "Intel C++ Compiler Reference" documents. These documents does have preliminary examples to start writing SSE3 & SSE4 based code.
It's very hard to find and address your needs. The Intel Vol-I & II Instructions Manual would take long time for any beginners to conquer it. Would advice follow few things to start with -
I think abovefour should be sufficient to guide you in writing small SSE2 examples program and than latter move to Intel Vol-I & II Instructions Manual for better understanding OR try referring in between for any doubts related to definition of any instructions.
Perform google search too if needed apart from above 4 helps.