Abstract:To increase the spatial overlay analysis efficiency of massive land use vector data, we proposes a method of land use vector data spatial overlay analysis based on Spark. The method realizes index filtering and overlay calculating of land use vector data by resilient distributed datasets(RDD), which is a basic data structure of Spark. It makes a new attempt to overcome the bottleneck of land use vector data spatial overlay analysis. Comparing with the traditional overlay analysis based on Oracle data management, the approach increases the spatial overlay analysis efficiency significantly, which is more suitable for spatial overlay analysis with massive land use vector data, and can help us to manage and analyse the massive land use vector data.