The issues you are having with hyperthreading is due to how the OS manages the threads. Normally for proper use of hyperthreading, the OS will make sure that all physical cores are in use before a logical core is used, and the most demanding threads should never share a physical core unless there is no better alternative.
If the OS does not fully support this, then you end up with cases where 1 physical core is stuck having to handle 2 heavy threads while there are other available physical cores being underutilized.
A good example of this (while not intel), is the crappy AMD FX where they went with the core module crap. The core modules work better than hyperthreading as most of the processing hardware is duplicated, thus fewer components are shared. (hyperthreading shares all of the processing hardware of the core, but allows it to load up 2 threads thus all of the cores processing components can be better utilized)
http://www.extremetech.com/computing/138394-amds-fx-8350-analyzed-does-piledriver-deliver-where-bulldozer-fell-short/2As you can see, if 2 threads fall on the same core module (thus some processing components are begin shared), then you get an overall drop in performance. If 1 core in each core module is disabled, then the overall performance of a 4 threaded workload increases. If all 8 cores are turned on, and the OS is left to decide which cores are used, then you get a non optimal use of the 4 threads, and some end up sharing a core module, and overall performance suffers.
The performance impact of 2 threads falling on the same core in a hyperthreaded environment is far higher since no processing components are duplicated, so the worst case scenario is an application not being able to fully use all of the physical and logical cores, and then mistakenly assigning 2 important threads to the same physical core.
Lucky for intel is they did not try to hide the fact that the logical cores were not true cores, thus it is easy to identify which is a physical core, and which is a logical core (provided they release updated drivers to properly inform the OS). On the AMD side, they did do the same, instead they count all of them as a full core, thus there are many cases where the FX chips are perform far slower than the Phenom II (older gen).
The Core i7 3770k is 4 physical cores, and 4 logical cores (which share all of the resources of the physical core, thus they only offer a benefit if there are parts of the core not being fully used)
On a side note, if you use adobe premiere pro, then 32GB will be the minimum that you need to fully use each core since it allocates memory for each core in order to make sure that you you do not run memory limits. (64GB is the bare minimum for 4K video)
Adobe premiere pro is designed around supporting a large number of cores, and is regularly used in dual and quad CPU environments.