winzip icon

Sorting array and removing duplicates

Submitted on: 2/14/2015 6:05:00 PM
By: Folo (from psc cd)  
Level: Advanced
User Rating: By 9 Users
Compatibility: VB 4.0 (16-bit), VB 4.0 (32-bit), VB 5.0, VB 6.0
Views: 4027
     The following code is a demonstration of how to remove duplicates from a string array as quickly as possible. The program first uses the very fast TriQuickSort algorithm to sort the array after it has been dimensioned and populated. Then it uses the remdups sub to remove any possible duplicates from the array. remdups assumes that you don't want any vbNullString in your array, if you would, then just replace vbNullString in the remdups sub with any other character, like Chr$(1), or whatever that floats your boat. This code can be used to remove duplicates from arrays that contains hundreds of thousands of entries, even millions. And it's very fast.
winzip iconDownload code

Note: Due to the size or complexity of this submission, the author has submitted it as a .zip file to shorten your download time. Afterdownloading it, you will need a program like Winzip to decompress it.Virus note:All files are scanned once-a-day by Planet Source Code for viruses, but new viruses come out every day, so no prevention program can catch 100% of them. For your own safety, please:
  1. Re-scan downloaded files using your personal virus checker before using it.
  2. NEVER, EVER run compiled files (.exe's, .ocx's, .dll's etc.)--only run source code.
  3. Scan the source code with Minnow's Project Scanner

If you don't have a virus scanner, you can get one at many places on the net

Other 1 submission(s) by this author


Report Bad Submission
Use this form to tell us if this entry should be deleted (i.e contains no code, is a virus, etc.).
This submission should be removed because:

Your Vote

What do you think of this code (in the Advanced category)?
(The code with your highest vote will win this month's coding contest!)
Excellent  Good  Average  Below Average  Poor (See voting log ...)

Other User Comments

12/16/2015 2:09:40 AMNick Bullar

Excellent code, can check for duplicates in 34 byte long lines: it can check about 4 million in under 3 minutes which is great... obviously then I run out of memory.

I have some novel ideas for how to change for any duplicates in say ~1 billion, 34 char strings. VB6 might not be the program to do it in. I wondered if you could input any ideas.

I understand the checking time could be several hours or days, that's no problem.
(If this comment was disrespectful, please report it.)


Add Your Feedback
Your feedback will be posted below and an email sent to the author. Please remember that the author was kind enough to share this with you, so any criticisms must be stated politely, or they will be deleted. (For feedback not related to this particular code, please click here instead.)

To post feedback, first please login.