Methods For Processing And Analyzing Protein Structure Collections For Data-Driven Structure-Property Relationship Modeling