Skip to main content

Posts

Showing posts with the label DataSet

Your help in Fashion : 7 layer CNN at your service (~92% accurate)

Hey Readers! Welcome to yet another post where I play with a self designed neural network. This CNN would be tackling a variant of classical MNIST known as Fashion MNIST dataset  . Before we start exploring what is the approach for this dataset, let's first checkout what this dataset really is. Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. Zalando intends Fashion-MNIST to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms. It shares the same image size and structure of training and testing splits. The original MNIST dataset contains a lot of handwritten digits. Members of the AI/ML/Data Science community love this dataset and use it as a benchmark to validate their algorithms. In fact, MNIST is often the first dataset researchers try. ...

AI Rivalries : Flipkart enroute in creating Alexa's Nemesis?

Hey Readers! A recent development caught my eye in the field of E-commerce rivalries! Have a look at the following article clipping: Journalism Credits : Business Today Group Walmart-backed Flipkart has just issued a challenge to Amazon's Alexa and Google Assistant. The home-grown e-commerce giant today announced that it has acquired Bengaluru-based artificial intelligence (AI) startup Liv.ai, which has developed a platform that converts speech-to-text in nine regional languages apart from English. With this move, the e-tailer hopes to soon offer an end-to-end conversational shopping experience for its users. "Given the complexities in typing on vernacular keyboards, voice will become a preferred interface for new shoppers. One does understand that building a voice interface is complex, and is especially challenging in Indian context given multiple languages and accents," Flipkart CEO Kalyan Krishnamurthy said in a statement. "Ultimately, we want to give ou...

Kaggle Dataset Analysis : Is your Avocado organic or not?

Hey readers! Today, allow me to present you yet another dataset analysis of a rather gluttony topic, namely Avocado price analysis. This Data set  represents the historical data on avocado prices and sales volume in multiple US markets. Our prime objectives will be to visualize the dataset, pre-process it and ultimately test multiple sklearn classifiers to checkout which one gives us the best confidence and accuracy for our Avocado's Organic assurance! Note : I'd like to extend the kernel contribution to Shivam Negi . All this code belongs to him. Data Visualization This script must procure the following jointplot  While a similar joint plot can be drawn for conluding the linearly exponent relations between extra large bags and the small ones. Pre Processing The following script has been used for pre processing the input data. Model Definition and Comparisons We will be looking mostly at three different models, namely ra...

Artificial Intelligence Pens Shakespeare Sonnet!

Hey guys! I recently came across one excellent poetry algorithm named Deep-Speare.  It was developed by   A four-person team from the School of Information and the Graduate School of Education designed a computer algorithm   which was able to successfully fool people  trying to distinguish between human- and bot-written verses nearly 50 per cent of the time.  However, experts could still easily identify machine-generated poetry, and AI may have a long way to go before it can outdo Shakespeare, researchers said.  Computer scientists at University of Melbourne in Australia and University of Toronto in Canada designed an algorithm that writes poetry following the rules of rhyme and metre.  In some ways, the computer's verses were better than Shakespeare's. The rhymes and metre in the machine-generated poetry were more precise than in human-written poems. Test Run Feedback  The following excerpt is courtesy Economic Times: ...

Predicting Cost of Tender with 99.24% Accuracy : Miracle!

Data Science is reaching new levels and so are the models. But reaching a whooping 99.24% accuracy using simple feature engineering and a simple Decision Tree Classifier ? That's new! Hello everyone, today I am going to present you my model which can predict value range of a tender in Seattle Trade Permits with a whooping accuracy of 99.24 % (With some obvious caveats which I will discuss in the end). My Kernel : Yet Another Value Prediction The Prediction Kernel BASIC EDA This time out, I am going to use plotly library in Python. This is literally the best option for interactive plots and if you actually visit the kernel, you will understand why. First of all, we will focus on checking out the Top Grossing Contractors in the Seattle area who have earned the most out of the tender acquisitions. This will lead to this interactive graph: Similarly, one could plot out another graph for Amount earned per project. But another thing which caug...

Datasets by Microsoft Research now available in the cloud : Microsoft announces open Datasets!

Hey Readers, today I bring forth an exciting news for you all aspiring data scientists and machine learners! Something new happened in Microsoft Research Blog :  The Microsoft Research Outreach team has worked extensively with the external research community to enable adoption of cloud-based research infrastructure over the past few years. Through this process, we experienced the ubiquity of Jim Gray’s fourth paradigm of discovery based on data-intensive science – that is, almost all research projects have a data component to them. This data deluge also demonstrated a clear need for curated and meaningful datasets in the research community, not only in computer science but also in interdisciplinary and domain sciences. Today we are excited to launch  Microsoft Research Open Data  – a new data repository in the cloud dedicated to facilitating collaboration across the global research community. Microsoft Research Open Data, in a single, convenient, cloud-hosted ...

Total Pageviews