Thursday, October 1, 2015

Data Mining by Clustering

Clustering is one of the most used techniques in the Data Mining field. Its objective, in ARGOS project, is to help analyzing existing data by subdividing these data into a small number of clusters or (groups) where the events inside any specific cluster are very similar to each other and very dissimilar to the events of the other clusters.
As a result of the clustering process, several important information will be obtained among which two major ones can be quoted:


• The reduction of a large amount of events into a smaller number of events which contain almost all of the useful and relevant information

• the highlighting of regular and recurring events on the one hand which represent the frequent and “normal” events and relatively rare events on the other hand which represent infrequent and eventually “abnormal” events
Each cluster is described by the most important attributes that participated to the grouping of its events. This leads to the discovery of the profiles of the "normal" events as well as the "abnormal" ones. 

The clustering method we use in Argos project is “Relational Analysis theory” [2]. This method has very powerful advantages with respect to the k-means method (the most used method for clustering), among which we can quote:

• no need to fix arbitrary the number of clusters to be found in data. 

• no need to fix the clusters’ centroids (like in the k-means method)

This method has been applied in several fields (insurance, banking, video recordings, marketing, etc.). An example of its use in video recordings can be found in [1].

By Dr. Hamid Benhadda and Mikael Griffoulieres  -  Thales Services

References

[1] H. Benhadda, J.L. Patino, E. Corvee, F. Bremond, and M. Thonnat. “Data mining on large video recordings”. Colloque V.S.S.T.2007 : Veille Stratégique Scientifique & Technologique, 21-25 Octobre, Marrakech, 2007.

[2] Mustapha Lebbah, Younes Bennani and Hamid Benhadda. “Relational Analysis for Consensus Clustering from Multiple Partitions”. Machine Learning and Applications. ICMLA 2008: Seventh International Conference. pp 218- 223. San Diego, California, December 11-13, 2008.

18 comments:

  1. Big data is set to take the healthcare industry to the next-level of profit making. However it is imperative that healthcare institutions take a more holistic, patient-centric approach that focuses on superior health-care results and treatment expenditures. See more benefits of data mining in healthcare

    ReplyDelete
  2. It was really a nice article and i was really impressed by reading this
    Big data hadoop online training Bangalore

    ReplyDelete
  3. Thanks a lot very much for the high quality and results-oriented help. I won’t think twice to endorse your blog post to anybody who wants and needs support about this area. data science training in Hyderabad

    ReplyDelete
  4. I would also motivate just about every person to save this web page for any favorite assistance to assist posted the appearance.
    Digital Marketing Training Institutes in Hyderabad

    ReplyDelete

  5. Very informative post ! There is a lot of information here that can help any business get started with a successful social networking campaign !

    Best Data Science Courses in Hyderabad

    ReplyDelete
  6. I’m happy I located this blog! From time to time, students want to cognitive the keys of productive literary essays composing. Your first-class knowledge about this good post can become a proper basis for such people. nice one
    data scientist certification

    ReplyDelete
  7. Wonderful blog post. This is absolute magic from you! I have never seen a more wonderful post than this one. You've really made my day today with this. I hope you keep this up!
    data scientist training and placement in hyderabad

    ReplyDelete

  8. I guess I am the only one who came here to share my very own experience. Guess what!? I am using my laptop for almost the past 2 years, but I had no idea of solving some basic issues. I do not know how to Download Cracked Pro Softwares But thankfully, I recently visited a website named procrackhere.com
    Awesome Miner rack

    ReplyDelete
  9. It would help if you thought that the data scientists are the highest-paid employees in a company.data science course in kochi

    ReplyDelete
  10. The first and foremost thing when learning data science is the discovery of data insight. In this aspect, the raw data is analyzed to gather information from raw data.
    data science course in gorakhpur</a

    ReplyDelete

  11. Develop technical skills and become an expert in analyzing large sets of data by enrolling for the Best Data Science course in Bangalore. Gain in-depth knowledge in Data Visualization, Statistics, and Predictive Analytics along with the two famous programming languages and Python. Learn to derive valuable insights from data using skills of Data Mining, Statistics, Machine Learning, Network Analysis, etc, and apply the skills you will learn in your final Capstone project to get recognized by potential employers.

    Data Science in Bangalore


    ReplyDelete
  12. Data Science helps in buyer retention by figuring out the triggers and churns of a business.

    Data Analytics Course in Calicut

    ReplyDelete
  13. I think this is a really good article. You make this information interesting and engaging. You give readers a lot to think about and I appreciate that kind of writing.
    data science training

    ReplyDelete
  14. In the heart of Gurgaon's technological landscape, APTRON's Data Science Institute in Gurgaon stands as a hub of excellence. Its comprehensive curriculum, expert faculty, practical approach, top-notch infrastructure, placement assistance, and networking opportunities make it a standout choice for individuals aspiring to excel in the field of data science. By choosing APTRON, you're not just enrolling in an institute – you're embarking on a transformative journey toward becoming a proficient data scientist ready to conquer the data-driven world.

    ReplyDelete
  15. Are you ready to embark on a rewarding journey into the world of data science? Look no further than APTRON, the foremost Data Science Training Institute in Gurgaon . In today's data-driven world, mastering the art of data science is a surefire way to boost your career prospects, and APTRON is here to guide you every step of the way.

    ReplyDelete