Solving the thermal problems of HD2 or other snapdragon powered devices

Search This thread

AFP2003D

Member
Dec 13, 2008
16
0
Question about jtag and overheating

hello my friends. I appreciate the effort that @facdemol do to overcome this problem.
thank you all for testing and sharing thoughts.

I have a problem.
I have hd2 Tmobile with 1GB ROM and i have update it with a wrong update virsion which contains number 51 . This bricked my hd2 and now no bootloader, no screen . No charging. Nothing in the screen. It is dead.
the phone has never been opened at all.

my question is:

will jtag process lead my phone to have overheating issue? ( Note: I dont have any heating or other problems with my phone before i bricked it )

or heating problem have another cause not jtag??

please make it clear for me so I can send it to to the nearest country for fixing. (we dont have jtag in Oman )


thank you all in advance...
 

DTNT

Senior Member
Oct 17, 2009
60
2
Tashkent
hello my friends. I appreciate the effort that @facdemol do to overcome this problem.
thank you all for testing and sharing thoughts.

I have a problem.
I have hd2 Tmobile with 1GB ROM and i have update it with a wrong update virsion which contains number 51 . This bricked my hd2 and now no bootloader, no screen . No charging. Nothing in the screen. It is dead.
the phone has never been opened at all.

my question is:

will jtag process lead my phone to have overheating issue? ( Note: I dont have any heating or other problems with my phone before i bricked it )

or heating problem have another cause not jtag??

please make it clear for me so I can send it to to the nearest country for fixing. (we dont have jtag in Oman )


thank you all in advance...

Your issue is an topic of another discussion. please try to search in neighbour themes!
 

AFP2003D

Member
Dec 13, 2008
16
0
I didnt demand a lot
It is just a question. Will i have heating problems after jtag or not?
If Yes then thank u
If No then thank u
Thats it

Sent from my GT-I9000 using Tapatalk
 

motoi_bogdan

Senior Member
Sep 8, 2007
319
337
@ booze_pistol - really sorry to hear that. That's the exact way one of my hd2's died, although i did exactly as with the first one and the method was tested and found to be successful. It seems that although the problem looks the same, not all hd2's have the same bga pins broken in the same way for we to believe that an universal fix will be available.

I dissected that hd2 motherboard that died. It had the same problem as yours. On power up it begins to heat up. No led's, nothing on the display. After the cpu removal i found out that 2 of those small balls were in contact to each other, somewhere around the center of the chip. The board was dead because of only 2 small contacts touching each other. Should i know that back then i would have tried more reheating sessions in various positions hoping that i could somehow break that short circuit.

I really don't know what to say at this time. Although i've found out in my experiments that in the end, applying heat is the only way to solve this problem, my further progress lead to users killing some phones while success still being limited. As far as i know, I fixed 2 hd2's, you fixed one (using an alternate heating method) and ubejd fixed another (using a pretty expensive infrared SMD rework station). Except for that.. I also killed a hd2, you killed another and there are still couple more examples only in this thread.

I wish i could have did all of the experimenting on my side and give you guys a clear and 100% sure thing solution. Alas i cannot, i don't have neither the tools or spare hd2 broken motherboards.

@AFP2003D JTAG doesn't involve heating up the cpu. There are several check points on the board, and from there the JTAG pins are connected in order to flash the phone. It has nothing to do with the problem described here, nor it will produce it. It is true that the JTAG protocol itself needs the CPU (jtag is a function of the CPU) but it cannot produce thermal problems by its own.
 
Last edited:

HillsRider

Senior Member
Feb 14, 2011
142
16
South Dakota
I have managed to revive my HTC Incredible that was suffering from the same problems. It was at the point of shutting down while not in use if the ambient temp was anywhere above 26 C. I removed the motherboard and heated the processor to 200 C for 10 minutes and reassembled. I am only 2 days into testing the phone but have only had one reset so far and it immediately recovered after the reboot.

Thanks to facdemol for pointing me in the right direction!
 

motoi_bogdan

Senior Member
Sep 8, 2007
319
337
well, i'm glad it helped. As you can see here, we have pretty mixed up results, so i guess you were lucky.
One more thing, what did you use to heat up the board/cpu?
 

HillsRider

Senior Member
Feb 14, 2011
142
16
South Dakota
well, i'm glad it helped. As you can see here, we have pretty mixed up results, so i guess you were lucky.
One more thing, what did you use to heat up the board/cpu?

Well, I'm sure it isn't something that would be recommended to anyone, but I used a soldering iron. It is a decent iron with temp control so I removed the tip and made a copper plate the same size as the processor to replace it. I started with the soldering iron cold, positioned it on the processor and started heating till it hit 200, which took 4-5 minutes. I left it for another 5 minutes once it was up to temp then let it cool back to room temp.

I would say I got really lucky. :rolleyes:
 

booze_pistol

Member
Jun 14, 2011
14
5
FRANKLIn
@ booze_pistol - really sorry to hear that. That's the exact way one of my hd2's died, although i did exactly as with the first one and the method was tested and found to be successful. It seems that although the problem looks the same, not all hd2's have the same bga pins broken in the same way for we to believe that an universal fix will be available.

I dissected that hd2 motherboard that died. It had the same problem as yours. On power up it begins to heat up. No led's, nothing on the display. After the cpu removal i found out that 2 of those small balls were in contact to each other, somewhere around the center of the chip. The board was dead because of only 2 small contacts touching each other. Should i know that back then i would have tried more reheating sessions in various positions hoping that i could somehow break that short circuit.

I really don't know what to say at this time. Although i've found out in my experiments that in the end, applying heat is the only way to solve this problem, my further progress lead to users killing some phones while success still being limited. As far as i know, I fixed 2 hd2's, you fixed one (using an alternate heating method) and ubejd fixed another (using a pretty expensive infrared SMD rework station). Except for that.. I also killed a hd2, you killed another and there are still couple more examples only in this thread.

I wish i could have did all of the experimenting on my side and give you guys a clear and 100% sure thing solution. Alas i cannot, i don't have neither the tools or spare hd2 broken motherboards.

I am certain that pins on the CPU have melded together, maybe only 1 pair, maybe more. It is definitely a dead short. The almost immediate heating up as soon as power is applied is a direct indication of this. I have considered and may try to repeat the process, using different angles and positions of the board to see if I can break them loose. There is nothing to lose at this point huh? :D

I knew the risks involved in doing this. I don't regret trying it at all. It should be noted even Infrared SMD rework stations are not 100% successful. Ask anyone who repairs xbox 360's. They are without a doubt the preferred method, but nothing is certain. This problem with HD2's is different with every phone, even though many phones share the same general problem, each and every phone is going to be different. A universal fix is likely impossible, especially using household appliances.

I still appreciate all your work, and without this thread and this board I would still have two non working HD2's instead of one.
 

Capt.PP

Senior Member
Oct 19, 2010
441
86
OC
I want to follow your guild to make a heatsink. But I can't view the pictures on how to make it and where to place it. Can you please, post the pictures in different picture hosting site because imagewoof.com server is down?
 
Last edited:

DTNT

Senior Member
Oct 17, 2009
60
2
Tashkent
I want to follow your guild to make a heatsink. But I can't view the pictures on how to make it and where to place it. Can you please, post the pictures in different picture hosting site because imagewoof.com server is down?

I downloaded those pictures to the MS Word file. Please see them. File is attached in a RAR format.
 

Attachments

  • pix.rar
    2.7 MB · Views: 110
  • Like
Reactions: Capt.PP

andrew-o

Member
Nov 9, 2010
8
0
Hi all!
I had the same problem with my hd2. First time it was hanging and rebooting. I made a good cooper "radiator" :) on the motherboard as in the first post. Everything was ok, but in a few weeks it hangs and do not starts, just whit screen.
I resoldered motherboard as described in service with soldering machine.
And now hardware buttons on a phone are not working, when i connect charger and press reset, the phone starts, but screen is black, just little backlight i can see.
Also i have installed MyMobiler soft on my phone, and when it is connected to pc and syncronized, i can see screen of phone on PC and manage it. So motherboard is alive, i can do anything through PC, touchscreen of phone is working, but screen is black.
Can you give me an advice?
Somewhere i read that it can be such issue with controller of power to screen, that is situated in the opposite side of processor. But i can't find this info now.
I think, i have to resolder motherboard again to fix this.
Maybe you had such situations.
Thanks.
 

Capt.PP

Senior Member
Oct 19, 2010
441
86
OC
I downloaded those pictures to the MS Word file. Please see them. File is attached in a RAR format.

Thanks man, my HD2 is getting worst now. I only have about 10 mins of call time and 10 mins of gaming before it froze...I need to buy the right screwdriver before starting the operation.
 

DTNT

Senior Member
Oct 17, 2009
60
2
Tashkent
Thanks man, my HD2 is getting worst now. I only have about 10 mins of call time and 10 mins of gaming before it froze...I need to buy the right screwdriver before starting the operation.

You are welcome Capt.PP! The only way that can fix your problem can be gas oven method. I think only gas ovens (not electrical) can help in this issue. Or a hotplate described earlier. Screwdrivers you can find in any mini or supermarket in your neighbourhood.
ps. My HD2 do the same headaches as yours, to prolongue its life try to use ice, put it in plastic bag and place HD2 to that bag (or any tailormade ice holding cradle:) So in a days I will try to cook HD2 in gas oven cam and post pix and results respectively.
 

consolation

Senior Member
Apr 18, 2010
726
230
Redmi Note 9t
While gas ovens can get hotter than some electrics, good electrical ones won't have a problem. Plus, they will give you a much more even heat and won't have the moisture issues of gas powered ovens. Rememnet gas produces co2 plus h2o as waste, not a problem when hot, but you can get condensation when cooling. I think electricity is the prepared source.
 

andrew-o

Member
Nov 9, 2010
8
0
Well, so nobody had such problem with power to lcd? Or anything else could be damaged while resoldering?
 

DTNT

Senior Member
Oct 17, 2009
60
2
Tashkent
Well, so nobody had such problem with power to lcd? Or anything else could be damaged while resoldering?

Try to find in the beginning of the topic where Facdemol explained several after cooking effects and syndroms of HD2. I think if PC sees your device it can be fixed. Wait for experts answers also!
 

DTNT

Senior Member
Oct 17, 2009
60
2
Tashkent
While gas ovens can get hotter than some electrics, good electrical ones won't have a problem. Plus, they will give you a much more even heat and won't have the moisture issues of gas powered ovens. Rememnet gas produces co2 plus h2o as waste, not a problem when hot, but you can get condensation when cooling. I think electricity is the prepared source.

You can cool down device out of gas oven! And as I mentioned before electric ovens can have problems with electric waves induction (thru hot tungsten and etc). And of course it can cause/harm electrical scheme of device.
 

the_scotsman

Assistant Forum Admin
Staff member
I bought a TMOUS HD2 off eBay advertised as locking up, hoping to fix it and sell it on. After lots of playing around, I've found that it has this heating issue.

The phone doesn't get HOT, or even WARM...but it simply won't get past the splash screen, UNLESS I sit it on top of an ice pack....in this case, it works great.

I'm experienced in micro miniature soldering, and have *some* equipment at work.

I'll have a detailed read of your posts again and open this one up and have a look, see if I can do anything. I have a thermal gun that can direct heat onto the chip...but as you said, this is risky...

Have you made any progress recently on any process of solving this issue?

PS. Amazing work so far, your write ups are fantastic! ;)
 

booze_pistol

Member
Jun 14, 2011
14
5
FRANKLIn
Update on more success

So I have been busy working on this HD2 situation. I finally broke down and got my SMD rework station. I went with an Aoyue 968. Absolutely love it, it has made my life so much better. Also purchased 3 K-type digital probes. The best investment ever. they were $11.66 ea shipping and all on eBay. Just search "New K Type Digital Thermometer Temperature Sensor"

I have also been able to obtain 3 more HD2's. I was talking about it to some people at work, and they knew people, or friends who had them with this problem. I am happy to report that I have been able to fix all 3 of the phones. Well I broke one of the tiny cables to the camera flash unit on one, but all 3 phones are working like new again after the repair.

With the new equipment that I got, and with some truly accurate measurements , I have made some discoveries that hopefully will allow people to have a higher success rate with this issue. I was able to remove the CPU from the failed oven HD2 bake. The cpu release from the board at around 238C. In multiple tests of that board, and on that cpu the solder begins to melt at between 213 -218C. That is a very important discovery for all TMOUS users. This is the "sweet spot" if you will for repairing these phones.

No matter what heat source you are using, you need to achieve 215C at the CPU /board level. My 968 may be set at 320C to 360C, but that is not relevant. It is the temp that the board itself is that matters in this process. Using the Ktype probes after you properly calibrate them will allow you to get an accurate reading at the board. Going slightly higher than the 215C might work, but it is not necessary so far in my experiments.

With all 3 phones I repaired yesterday I used the hotplate as a bottom heater for the board. I set it to 185C , and let the board preheat for 5 minutes. Using the Aoyue 968 with 18mm square netted nozzle, I start it at 320C. VERY IMPORTANT. These are temps of the equipment, not the temp of the board. Using a Ktype probe on top of the board at the cpu, and one Ktype probe on the bottom of the board at the cpu, attached with aluminum tape, I am able to see the board level temp very accurately. When it reaches 215C I keep it there for 1 minute. Turn off top and bottom heat, let it cool completely. VERY IMPORTANT. keep the board perfectly level, do not bump it or move it at all.

This worked on all 3 phones. I am certain at this point it will have a high success rate ongoing. Grant you the equipment used not everyone has. My main point I am trying to contribute here is no matter your heat source do not exceed the 215C mark. buy yourself one or more of the Ktype probes I mentioned. Calibrate them and use them to make sure of the temps. It is not necessary to keep that 215C temp for more than a minute, two minutes max. You are trying to revive cracked / cold solder joints, anymore time than that you risk allowing the solder joints to merge and ruin the process.

To those who have nothing more than an "ELECTRIC" oven to use to fix this issue. Here is my two cents. Get a Ktype probe! It is a must. Preheat the oven to 225C. Attach the probe to the cpu with metal tape, put the board on a ceramic cup in the center of the oven, make sure it is level, allow the lead for the probe to come outside the oven to the meter. The probes I mentioned have heat resistant cables. Watch the temp on the meter, when the "CPU TEMP" , I repeat the CPU TEMP reaches 215C time it 1 minute, then turn the oven off and let it cool completely. This method allows for no preheating of the board, but placing it in the preheated oven it will rise to the temp gradually anyway. No more than 3 degrees per second is perfectly fine. Little chance of a board this small flexing and causing damage. I do not recommend this at all, but if your gonna try it in an "electric oven" my opinion is this is your best chance at success. The temp probe is the best friend you will ever have in this procedure.

Well I am off to look for more phones to repair. :D
 

Top Liked Posts

  • There are no posts matching your filters.
  • 61
    UPDATE -

    As of June 14. 2011 it appears the only viable way to solve this problem is to reheat the cpu with specialized equipment. I am currently testing if this can be done in a standard gas/electric oven (NOT MICROWAVE !!). If successful this method will solve the problem and the following guide (cpu cooling system) may help prevent it from happening again. Only building this cooling system is not enough and will not solve your problems.


    Hello dear members of XDA

    First of all please excuse my english, I will try to explain myself as well as I can. It will be a long post, it could be boring, it could be scary or whatever you like but bear with me on this one.

    So.. got myself a HTC (T-Mobile) HD2. A bit late in the game, but hell no, still a good phone. I've dreamed of having one but couldn't afford. Anyway finally i got one. A second hand broken one, damn it. :p
    As i found out the problem is pretty common: damn thing restart itself - thermally related - the old CPU overheat problem. By searching the net I found out that it's pretty common with some HTC models. HD2 has it, Desire has it, Nexus One has it, hell even some xperia models have it.. about half of the devices powered by anything from the Snapdragon series could have it.
    The problem could be easely described as : phone hanging, restarts, the dreaded 7-8 short vibrate sequence - phone locked etc.
    Mine was worst then i've seen on the forums or with other people. It locked itself for just about every reason i could get. Taking pictures, browsing the menu, using gps, the browser, 3g or wifi, watching a movie ... all concluded with restarts or lock-ups after some couple of minutes. I've found out that keeping the phone at 4-5 degrees celsius would solve my problems in most cases, but anything above 10-15 degrees would make the thing go crazy.

    Well, I'm pasionate about electronics, development in this area, trying to solve problems and things like that. Also experienced in heat and semiconductor related problems. I also had one macbook air that suffered from core shutdown because of overheating (also a well known problem for MBA rev 1.0) and managed to design an alternate cooling system that solved the problem. So i gave it a shot, i know there are many users that have similar problems and altrough i don't suggest them explicitly to make this hacks to their phones.. this is one way to solve the problem if you buy your unit second hand or don't have some form of warranty.

    So here we go.

    Big fat warning!!! Don't attempt these things with your phones unless you are familiar with the concepts or the tools involved in the process. Also, there is a real risk to permanently damage your phone. Not just real.. but big if you get something wrong.

    First step is to run some simple tests to determine the cause of the problem or the range it extends to.
    So, I used a multimeter with a K type thermal probe to measure the temperature of various components of the phone during intensive use.



    this is the back of the mainboard of my HD2. If you notice, HTC placed a blue-ish thermal pad over one metallic shield covering the back components. I don't know what's the purpose as the back casing in that area is made of plastic - no heat dissipation, or a bad one. Anyway that's a good place to place my probe. Some tape holded the probe in position. Because we don't have perfect mechanical contact between the probe tip and the casing or chips i expect +1 or +2 degrees celsius to be added to each measurement i will later describe.



    i now placed the battery over the back of the phone and secured it with some other tape and some toothpicks :D



    we're at 19.3 degrees. That's were we'll going to start from.

    there's a usefull little app that allows users to overclock or stress test their phones cpu. Found it here on XDA, i'll use it for some heat making purposes.



    as you can see.. we're already at 25.8 degrees, after 5 minutes of testing.. not to mention the actual heat making primary suspect - qualcomm chipset is on the other side. At 29.5 degrees at this point.. the phone locked itself. I reapeted the experiment 2 more times - got exactly the same result.. at least the readings were consistent.

    Ok, i then removed the motherboard to take some readings from the actual CPU.



    same procedure.. next readings. - at around 33.4 - 34.2 degrees (varies) on the CPU itself the phone will either restart or lock itself up. So you see how serious my problem is. Summer will come so I won't be able to use my phone... :p
    Measures have to be taken.

    Let's make a small introduction about heat related to semiconductors.

    Well, simply put a conductor (semiconductors act the same way) generates some amount of heat when an electric current is passed along it. This is because of the fact that small electrons moving along the conductor (in a simple way that's the definition of any electric current) will ocasionaly collide with the atoms of the material their passing through. In the collision the electron loses some amount of energy. That energy is heat. Also, heat itself can be described at an atomic level as the intensification of natural ocuring brownian movement of atoms. If they move a lot, if they are more agitated they create more heat. If they are more agitated, they are more likely to be hit by passing electrons. So a hot conductor is more likely to get even hotter because of that. There is a point were the heat generated makes the conductor's atoms prone to more hits from passing electrons in kind of like a geometric progression. That's called thermal runaway. It will tend to destroy electronics by overheating, melting or burning themselves up.
    Back to our phones now. The CPU produces heat. Because of the same effect described above. The heat in this case will either melt or break the small "balls" that comprise the BGA matrix on what these cips are mounted on. The small balls will either melt (extreme cases) or dilatate with increasing temperature. However it seems most of the new processors used by HTC are mounted in some epoxy resin that has both dilatation point and melting point higher then the flux and welding compound used to solder those cips. So the actual cip will tend to stay fixed in a particular position, unable to expand or contract with temperature variations, but the balls used in the BGA matrix underneath it will contract or expand with these variations. This could lead to a case when at least one of that balls (some couple hundreds in total) become "loose" or out of position, thus breaking the electrical contact it should have made. Therefore our problems. At fist large amounts of heat must be applied in order to actually break the bond between the cpu and board, but after that, once broken the tiny links are very sensible to temperature variations and they will expand or contract freely.
    Most users notice that at it's core, the problem seemed related to overheating (in the begining) but after time it's effects are degenerative.. phones seem to restart with no apparent reason. It's still overheating, but things are starting to get more and more worse as the chip and it's connections become more sensitive to heat variations. Thus, even small variations now produce these problems - my CPU restarts at 34 degrees .. that sucks.

    So, my only option was to try to reheat the cpu in the attempt to partially melt the broken "balls" in the bga matrix and hopefully.. i repeat HOPEFULLY they remake contact with the mainboard. A re-ball of this chip is not possible, as the resin placed around it by HTC doesn't melt at the normal temperature i could remove the chip itself, so heating it at even higher temperatures would risk killing the cpu long before the resin melts. Strange move by HTC to make things like this.

    Anyway.. here goes nothing..


    I've placed the usual aluminium foil designed to protect surrounding components by the heat generated by the rework station and the hot air used to heat up the CPU.



    I preheated the CPU for about 10 minutes, from both sides of the board, then switched to heating it at 360 degrees. I applied even pressure above it after it was heated in order to tighten the space between it and the board, just a little bit. THIS IS VERY RISKY. Normally not recommended because of the risk damaging the BGA. In this case the resin would prevent me from moving the chip to much so it's less risky. Not safe.. but less risky. :p

    I've let the board to cool on it's own for half an hour and repeated the temperature monitoring tests.
    Now i had an increase of maximum temperature before a restart from 34 degrees on the cpu to about 42. It's not much but it's a start. However above these temperature.. the phone will still lock or restart.
    I went for another round of reheating with the hot air station. After this, i've got slightly better results. Some 2-3 degrees more. My lucky break was when i suspected thermal runaway for the CPU. So i tried to make some sort of a heat sink for that chip using some mica foils for to220 can transistors, some thermal grease and a bunch of aluminum and copper foils. My theory was that heat dissipation will eventually accelerate faster above a specific level, a point from witch thermal runaway occur. In my case in the initial tests, even after the phone locked itself and i manually restarted (battery out - in) the temperature continued to increase even faster altrough the phone wasn't doing anything intensive.
    The role of my "heat sink" would be to dissipate more heat rapidly and in some manner to press the cpu against the board.
    After I placed the mica foils directly above the cpu with thermal grease above and beyond i mounted back the metal shield over that area. On it, i placed some more silicon paste and some thick copper foil (used in some broken laptops i have over here). It looks ugly but.. worth a shot:



    after that i begin making the rest of the heat sink using aluminum foil. I folded about 12 layers, between each of them having placed... more thermal grease and at the 6-7 layer another round of mica crystal foil.

    Here's the aluminum foil



    I then pressed the foils very hard between two flat surfaces in order to remove the excess thermal grease.
    I "anodized" the first layer (the one in contact with the cpu shielding) with some ferric chloride. Before that, the board looked like this:



    After the logic board was mounted back, i remade all the connections and after some preliminary tests, mounted the phone back together. It now looks like this



    I only have to re-attach the serial no. and imei, plastic sticker.

    Of course i then run tests. I heated up the phone with a hair dryer to simulate a hot summer day. About 40 degrees, just to be sure. I then run cpu stress tests and a full divx movie (impossible in the past). On preliminary testing, i had indications that i avoided the thermal runaway the cpu now running stable at 24 degrees (19.3 in the room - ambient temperature). No more, heating up by itself to about 40 degrees then restart.
    On the final testing, with the phone put together, i heated it with the hair dryer and achieved 40 degrees. I started it and run stress tests. No more lockups or restarts, not even a single one. However with the phone put together i can't measure inside temperature on it's components. As i feel it, it get's warmer, it heats up to some degree, but now it's spread all over it's surface. For some particular reason it doesn't restart anymore.

    I then tried, cpu stress test, wlan connection, pc connection and browsing the net all at the same time. NO RESTART :D I watched a full 1.30 hour movie at max playable quality, the phone was really hot (43-44 degree at it's surface) but still no problems.
    It appears that for the moment i saved the phone. However, future behavior is still to be determined.

    I'll get back with more testing, in the following days and eventually i hope to devise a general method for building heat sinks for phones (yeaah, ridiculous....) using combinations of metal and thermal conductive cristals. The ideea is to find out if reheating the chip by hot air station can be avoided (this involves the most risk). But the start is promising. By the time warranties will expire and phones like the new droids or winmo 7's start to break from thermal problems, maybe i'll have some sort of a more user friendly solution.

    EDIT JUNE 04.2011
    since i have a dead hd2 motherboard here, i tried to remove the cpu to expose the BGA soldering. Just for fun, no chance of BGA reball, as there aren't any tools available for this particular chip. The resin prevents a proper removal, at about 450 degrees celsius it was still kind of hard, so i had to forcefully remove the chip and break some of the BGA. The chip is very thin, kind of like a micro sd card. It heats up pretty quick and fast, the solder points underneath it got melted in about 2-3 min at 370 degrees celsius.
    Here's how it looks.
    This is the motherboard without the chip. The BGA matrix is broken, some balls were simply ripped out when i forcefully removed the chip.



    This is the actual chip compared with a mini sd and and standard sd card.



    ...and this is the underside of the chip. belive it or not, the chip is actually alive and it's pins are ok. It cannot be used because it cannot be properly soldered to a board. Guess i'm gonna punch a hole through it and use it at my key chain, along with a laptop cpu already there ;)



    In the following days i will experiment with the solder points&materials in order to try to produce a more safer method to reheat future boards with thermal problems. It seems this board died because of overheating and a short circuit made over the center of the array by 3 solder balls that got in contact once they were melted.
    4
    the mica crystal pads should be available at any electronic components store. If you can't find any, you could try to substitute them with any other similar purpose material. Use only thermal pads used in electronics for semiconductor (transistors mostly) thermal dissipation. However from what i know or can test, the mica ones are superior to other designs or materials.
    Also, good quality thermal paste is a MUST. Cheap one tend to dry out or loose effectiveness over time.

    @ profahmad - yes, the back of the lcd unit is metallic. Normally it was not intended to provide heat dissipation, neither is in direct contact with the heat making components, but it takes some of the heat and spreads it over it's surface. What i did is to forcefully use this piece of metal along with the materials i used for the "heat sync" in order to facilitate better thermal dissipation. The HD2 is build on the "edge" as you can see, even if the display unit is removed or improperly mounted, the small effect in cooling the board it once had is enough now to provoke some of the thermal issues.

    @januszgorlewski reheating is very risky without solid previous experience. Simply reheating the cpu didn't solve the problem for me, it only ameliorated it a bit. The new heat sync did the trick so i suspect you can skip reheating with not much of a loss in effectiveness. However i should have experienced with more devices in order to know for sure the effects of each stage of my experiment.

    @sqeeza yes, a petition could be filed out. However, there are 20-30 topics in this area about hd2 freezing or restarting but most people don't know there is a thermal problem related with these events. If we advertise the problem and it's cause to these people they could run some simple test to determine if their phones are also suffering from this problem.
    4
    first of all heating up the qualcom chip is recomanded as a last resort option. however if you reheat it, pressing the chip to the board is VERRY dangerous, as it could permanently damage the BGA connection.

    Here's some sort of guide on doing this. You will need a screwdriver, some 4-5 mica foil pads (you can get them from any electronic component store (get them for either TO3 or TO220 casing and cut them to the size of the cpu inside hd2) some good thermal grease (arctic silver or something for pc cpu's) an aluminum sheet for you to cut a piece of it.
    * i don't recommend silicon thermal pads, use only mica crystal pads
    * you can substitute the aluminum plate with aluminum/copper foil - the first is the one used for food wrapping)
    * i don't recommend using anything beside a smd rework station (either hot air or infrared) to heat up the board. Although a heat gun can develop high temperatures, the air debit is to high (dangerous, you can blow up other components) and you will lack precise temperature control needed for this job.


    1. Disassemble the phone following HTC's official videos. Completely remove the motherboard from the phone's casing.
    2. Once you have the motherboard de-attached remove all metallic shields on both sides. Normally these prevent EM interferences from the outside to get in and mess with electric signals over the PCB. We can use them as part of the "cooling" system later.
    3. OPTIONAL - efficiency yet to be determined/great risk involved - use either a special oven (not microwave !! it WILL kill the phone!) or a smd rework station to pre-heat the mainboard. Temperature must be set at around 95-110 degrees. Board must be heated from both sides, or at least one at a time, beginning with the one opposing the cpu side. Let it preheat at least 10 minutes.
    3a. after preheating, use an aluminum foil to cover the rest of the components, anything other then the cpu itself then get to the actual heating, switching first to 250 degrees and directing the air stream on the cpu itself (using a larger nozzle for the tip of the heating gun). After 2-3 minutes of 250 degrees, swich to 340-360 degrees and heat the chip for another 5minutes. Move the heating gun around the surface of the chip and try to heat it evenly. If you have the guts and you are crazy enough use a knife with a larger blade and put the tip of the blade in the hot air stream in front of the cpu. Let it heat for a while, and also, continue heating the cpu. When the blade tip is hot enough press the chip with it , starting from the center and following each side. Apply even force on each press and try to have the blade as parallel with the chip possible. Don't press too hard, if you haven't kill the chip yet, that will kill it.
    3b. let the board to cool down on it's own and during cooling try not to move it or do anything to it.

    4. place a little amount of thermal grease on top of the cpu then place 1-2 mica foil pads (depending on thickness) over the cpu. Gently press the mica foil with one finger over the cpu. Now place more thermal grease over that mica foil and try to place the metalic shield over that area. If successfully done, the metallic shield should be in contact with the mica foil and the grease. Place back all shields on the main board.
    5. On the phone's casing, measure the back of the display and try to cut an aluminum sheet of exactly the same size. If the sheet you can find is too thick - polish it and place it in a solution of either caustic soda or ferric chloride. This will get it thinner, but you have to supervise the process as if you leave it for long, the sheet could get completely dissolved. Check the sheet on short intervals (1min) to see the progress. Always use gloves and eye protection as both substances are dangerous (never mix them, use only one of them, the one you can get or already have). Once done, you will have a thin aluminum sheet that's flexible and about 1mm thick.
    6. notice there are some ribbons connecting the display to the motherboard or other exposed metallic contacts. Before placing the aluminum sheet over the display's back, place some insulating tape over those metallic contacts to prevent any shortcircuit forming between them and the aluminum sheet. Next place the aluminum sheet over the display's back. Be careful not to damage any connector or ribbon in the process.
    7. place more thermal grease on the cpu's metallic shield and check to see if the motherboard gets in good thermal contact with the aluminum sheet you just placed over the display's back. If there is still some space between them, use another mica foil and place thermal grease on both sides of it.
    8. reassemble the phone, and make some tests to see if you get some improvements.

    One more thing, this little project of our is in a "more to be seen/tested" state. As of now... only one device was fixed by this method - mine, it could have been simple luck. I don't know yet. :p more then a week later (strange weather also, + 20 degrees outside then last time i wrote the original post) the phone still works ok. Now running 1.3Ghz overclocked with NAND Android


    @ januszgorlewski i remember the first time the phone was vibrating 7 times and i didn't know about this problem, i though it was an WM6.5 Energy Rom feature :) .
    3
    as i found out until now the steps from a good working hd2 to this problems are something like this:

    1. phone working ok. mainboard (lower part of the device) heats in some conditions - demanding programs etc battery can reach about 40-45 degrees max. without problems. The phone will restart or freeze (cpu halt) in any of these situations :
    - battery temp exceeds 45 degrees and stays over this value for at least 5-10 minutes in order to trigger the thermistor used to measure the temperature in this area over i2c. at this time, it will prevent further charging and restart or lock the phone. This is normal behavior.
    -CPU exceeds 60-65 degrees (exact value still to be determined. i'm trying to get acces to some similar chipset datasheets). This produces CPU halt. Depending on what you're doing, the halt will either reset the phone or simply lock it up. Restarting by soft reset or by itself will probably return the user back in the home screen with the phone still working. This is also normal behavior, related to qualcomm chip.

    2. phone starts to malfunction. This condition starts by either large variation in temperature - mainboard al low temperature gets fast to full load or simply sustained full load. All HTC HD2's revisions have the same type of soldering in the cpu area. Visually speaking (no conclusive data yet) first revision used a bit more epoxy resin to secure the cpu in place. In the context of overheating and solder balls dilatation, that's not quite a good thing. Some sort of thermal spike must occur in order to break the contact between cpu and motherboard. Warning, if your phone will lock up and doesn't restart by itself, it's imperative that you disconnect the battery because as I measured, even with the phone locked, the CPU still overheats even more, thermal runaway occurs and temperature climbs to dangerous levels. I never left the phone do this for a long time, therefore I don't know how much it will still overheat, but it does and it will. In the initial stage of the problem, only extended heavy load use can trigger the problem. A common case is keeping the phone on in the car and using it for gps navigation in a hot summer. If the phone will restart before either 45 degrees at battery or 60-65 degrees at cpu level (however the last one is harder to measure) then you certainly have problems and they are just at the start.

    3. problems get worse. At this stage it is possible to notice the 7 short vibrates at boot time if the phone is warm or kept in a warm environment. You don't have to push it very hard, it only needs to be warm. The vibration pattern is an error code made by the actual qualcomm chipset, not sent by either bootloader, spl or operating system. When in happens the cpu will lock itself up, however file transfer (including nand memory acces, storage card acces and basic operations) or other chipset functions will still work for some time. It appears only cpu processing is being halted. So if this occurs when you boot the phone, it will lock up, but if this occurs when you are flashing a rom, you might continue to see the progress bar still filling. The vibration pattern signals a physical damage to the qualcomm chipset has ocurred. There's no way around it, when it occur it will never just .. heal up by itself.
    You will notice that the temperatures needed to induce a restart/lockup will decrease with time (both battery & cpu).

    4. Problem at it's worse. CPU can lock itself even at 35-40 degrees (measured at it's level). Ambient temperature of only 10-12-15 degrees is enought to have the phone experience problems. The cpu start to suddenly produce either lock-outs or hard faults or simply work intermittently. The OS may give errors relating ARM CORE failure or fatal errors regarding execution of certain "lines" (related to code lines in the os core programming). At this stage, the phone doesn't need to feel warm in your hands to produce these problems. This could trick some people not to still relate this to thermal problems and look for the solution or problem cause elsewhere. It's still related... but at it's worse.

    5. Total CPU collapse. If the phone locks and remains locked in whatever screen or program it was running, like i've said before, it will still overheat. If a stage 4 phone is left overheating, chances are that more balls connecting the chipset to the motherboard will fail. If any one needed to correctly initialise the chip or to power it on, fails - then it's end game for that phone. It will simply stop working and never turn on. Some other variants are that the phone will only start if placed in a freezer or start but never complete a boot sequence (either os or bootloader .. or both could be unable to start)
    3
    yep.. more than 2 weeks have passed and after i completed all possible tests the phone still works ok.
    About 22-25 roms flashed (wp7, wm6.5, android, ubuntu) phone was used either normally or heated with a hair dryer. At about 30 degrees ambient room temperature, i run some 720p testing and manage to run sample videos until battery died out, then rerun the videos while charging (charging induces more heat also).
    In all those 2 weeks i had only 2 restarts, both in wp7 (can't remember what rom version did that) and both occurring when i was setting up the phone after the phone update. Phone was cold however. I didn't manage to produce more restarts either when the phone heated up or i tried running intensive apps on it. Guess it was software related.
    So.. i guess it's over with this problem.