As someone who struggles with this as well (and I've finished my phd in 2018 :O), I think I have some helpful things to say.
Think about some theorem you've proved all by yourself, in all the technicalities. It may be from your master's or some hard exercise. Now, think about the summarized version of a proof: something like you could describe to another mathematician during a phone call. What details of the proof would you include? Which ones would you discard? I believe this phone-call version of a proof is what you're aiming for when you try to get the overall strategy for a proof.
As an example: the main result of my phd thesis consists of, basically, complexifying a real-analytic foliated hypersurface (which is written in a bad way, too many terms in a sum), then blowing up the singularity and computing the holonomy of the leaves. I then showed that these holonomy groups are all linearizable, which allows me to use previous results to write my hypersurface in an adequate normal form, discarding higher order terms in its first scripture. As a context, this generalises to a infinitude of cases what was previously done to a few singularities classified by Arnold.
This is something another mathematician from my field (singularities, foliations...) would have a general picture. All the keywords are there.
The difference is, you now have to extract this from a proof in a paper without necessarily breaking it down to its smaller parts (since it's not something you did previously!). This is something that comes with practice and your own guts ("how deep do I have to get to understand this?"): read, read, read. Usually papers offer a brief explanation on the general technique or context of its main results: aiming to see that in the proofs may be a good way to do so.
Lastly, deciding which things to really break down or not is something that also comes from practice. This is the part I have the most difficulty. I usually try to do the first one and, if too many things seem interesting or useful, I try to perform a finer reading of the text.
Also, something that usually worries me when I'm reading stuff is: "shouldn't I have studied and mastered all these topics during my phd? Why can't I follow this proof like I can do in a textbook?". Too many topics won't be covered during your phd and get ready to face advanced new stuff out of the blue ("I wanted to study differential equations, what the **** is a scheme??", I heard someone say... :P )