Bridges: Processing large number of large files

Hey guys,

Do you have suggestions on best way to submitting batch job on Bridges for a project with large number of large file?

I have a projects where I need to process around 100,000 csv.gz. An average gz file is 500 MB. I am currently processing with grep | sed | awk. So approximately for each file the processing time is 0.5 hour on a RM-shared core.

My thinking on this is that I submit 100,000 RM-shared batch where each batch process one file. But my concern is that my later jobs will have lower priority since I have already submitted/completed many jobs.

I cannot find such mechanism in PSC documentation but I think it's a common thing for supercomputers. Can anyone help me verify that? Also any suggestions are welcome.