I am currently working on an application of logistic regression for lanslide susceptibility assessment. In order to extract predictors' values form raster layers, I generated equally spaced grid points in landslide area and landslide - free area. It is generally recognized that the point samples should be about the same size (number of 1 - landslide points = number of 0 - non-landslide points). If this is the case, then the density of points will be significantly higher inside the landslide area than outside the landslide area, because the landslide area is generally much smaller than the landslide free area. Some say this difference in density is a critical issue. How does it affect the quality and reliability of the computed regression model? Are there any sampling alternatives?