Uploaded image for project: 'Maven'
  1. Maven
  2. MNG-7592

String deduplication in model building

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • 4.0.x-candidate
    • None
    • None

    Description

      I currently investigate improving memory consumption in m2eclipse (maven ide extension) and noticed that one problem is that maven model seem to not deduplicate strings, so for large projects (I used apache camel as an example), there are a lot of duplicate strings hanging around, e.g. I see 12.000 instances of "org.apache.maven.plugins" or around 10.000 of "org.apache.camel" (please note that probably not all related to maven!).

      If I look at the Graph of incoming references I see for example that these are from Model/Artifact groupId.

      I know that string deduplication in general is hard and even controversial, but maybe one could think about such thing at least for the "hotsposts", e,g, groupId, artifactId and version or even managementKeys seem good candidates to be considered for such thing as these are used all over the place.

      Attachments

        Activity

          People

            Unassigned Unassigned
            laeubi Christoph Läubrich
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: