It is not clear from your question what exactly is your data and what are you trying to get rid of. If you already have a list of SNPs/Indels for your "very large genomic sequence" in the variant call format (*.vcf), consult the official documentation of the tool (https://vcftools.github.io/man_latest.html) to learn about the features and usage.