Cultural heritage is under tremendous pressure in the rapidly growing and transforming cities of the global south. Historic cities and towns are often faced with the dilemma of having to preserve old monuments while responding to the pressure of adapting itself to a modern lifestyle, which often results in the loss of cultural heritage. Indian cities such as Delhi possess a rich legacy of tangible heritage, traditions, and arts, which are reflected in their present urban form. The creation of temporal 3D models of such cities not only provides a platform with which one can experience the past, but also helps to understand, examine, and improve its present deteriorating state. However, gaining access to historical data to support the development of city-scale 3D models is a challenge. While data gaps can be bridged by combining multiple data sources, this process also presents considerable technical challenges. This paper provides a framework to generate LoD-2 (level-of-detail) 3D models of the present (the 2020s) and the past (the 1970s) of a heritage mosque surrounded by a dense and complex urban settlement in Shahjahanabad (Old Delhi) by combining multiple VHR (very high resolution) satellite images. The images used are those of Pleiades and Worldview-1 and -3 (for the present) and HEXAGON KH-9 declassified spy images (for the past). The chronological steps are used to extract the DSMs and DTMs that provide a base for the 3D models. The models are rendered, and the past and present are visualized using graphics and videos. The results reveal an average increase of 80% in the heights of the built structures around the main monument (mosque), leading to a loss in the visibility of this central mosque.