Closed Thread
Results 1 to 3 of 3

Thread: Need a data analysis program for a project

  1. #1
    yellowbus is offline Newbie
    Join Date
    Feb 2008
    Posts
    1
    Rep Power
    0

    Need a data analysis program for a project

    Hi folks. I’m in the middle of a science fair project, and I have 3000 random numbers of data. I need to analyze this data for trends, common numbers, common combinations, favored numbers, and etc to prove that this is not a random set of numbers

    For example, my data would look like
    “1538973549105789587129857120985978473875291857918 …..” with 3000 numbers
    My goal is to see favored numbers (7 might appear 20% of the time, 0 might appear 4% of the time), combinations (777 might appear more often than it should in a random set of numbers, or 666 may never appear at all), trends (I’ve seen trends such as 345-245-645-745-045-145), and etc. to prove that this data is not random. The problem is there’s so many combinations of numbers to check for, and if I check only certain combinations such as 777 or 13579, it will be biased.

    Is there any some sort of data analysis program that can do this for me or help me in any way, or is there any way to program one to do this, or must I use the find function in Microsoft word and look for all trends and combination manually?

    I’ll get into the specifics if you need more info, thanks in advance

  2. CODECALL Circuit advertisement
    Join Date
    Always
    Posts
    Many

     
  3. #2
    Join Date
    Jul 2006
    Posts
    16,491
    Blog Entries
    75
    Rep Power
    143
    You can do this in most programming languages fairly easily. The issue will be determining what patterns you are looking for. For example, determine the frequency of all 1 digit strings, all 2 digit strings, etc.

    You will also need to determine whether anomalies are statistically significant. It is important to realize that 7 appearing 20% of the time, while unexpected, does not necessarily show that the data is non-random. For example, if you find that it is statistically significant with a 95% confidence, all you are saying is that there was a less than 1 in 20 chance of it occurring, not something all that significant, really.
    Programming is a branch of mathematics.
    My CodeCall Blog | My Personal Blog

  4. #3
    Izak is offline Newbie
    Join Date
    Feb 2008
    Posts
    5
    Rep Power
    0
    I think this might turn out to be more complicated than it appears, unless you are aware of all the constraints on what the pattern is the string could mean any number of things.

    Is the purpose to disprove the existence of a random number generator or to decipher a cleverly constructed code?

    Izak

Closed Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. sound data analysis program?? Any suggestions
    By kidMallok in forum C and C++
    Replies: 3
    Last Post: 10-30-2010, 10:39 PM
  2. Replies: 1
    Last Post: 07-21-2010, 12:26 PM
  3. How do I query mysql data via a link ($_GET?)? Glossary, term project help!?!
    By banker in forum Database & Database Programming
    Replies: 2
    Last Post: 04-20-2010, 01:51 PM
  4. Image analysis program: Java or C++ ?
    By espinchi in forum General Programming
    Replies: 3
    Last Post: 10-25-2007, 09:00 AM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts