Methods and systems of utilizing machine learning to provide trust scores in an online automobile marketplace
In one example aspect, a computerized method includes the step of providing an item listing. The item listing is listed in an e-commerce marketplace. The computerized method includes the step of identifying the item listing into a set of listing parameters. A listing parameter includes one or more listing images and one or more terms that are descriptive of the item listing. The computerized method includes using the listing parameters to do the following steps. The computerized method determines a trust score for the item listing. The trust score is based on a number of images of an item in the item listing. The computerized method a quality score of the number of images; a description score of a description of the item. The computerized method determines a pricing score. The pricing score is based on a percentage variation from a geographically relevant item valuation and an item research service, and a condition of the item. The computerized method determines a seller score. The seller score is based on the verified status of the seller, a seller rating of the seller, and a percentage of positive feedback. The computerized method determines a health score. The health score is based on a seller declaration, a service logs, a verification that the item is insured, and a verification that the item properly registered.
This application claims priority to U.S. patent application Ser. No. 15/292,111 filed on Oct. 12, 2016. U.S. patent application Ser. No. 15/292,111 claims priority to U.S. Patent Application Ser. No. 62/239,975, filed on Oct. 12, 2015. U.S. patent application Ser. No. 15/292,111 claims priority to U.S. Patent Application Ser. No. 62/407,497, filed on Oct. 12, 2016. U.S. patent application Ser. No. 15/292,111 is hereby incorporated by reference in its entirety.
BACKGROUND 1. FieldThis description relates to machine learning for optimizing e-commerce computing systems and methods and more particularly to a system, method and object of manufacture of machine learning to provide health scores in an online marketplace.
2. Related ArtIt is noted that users may have used automobiles (or other vehicle type). The users may want to sell said automobiles to other buying parties such as used-automobile dealerships etc. The buyers may want to verify the status of the used automobile. In an online marketplace, inspecting the automobile may not be an option as the automobile may be too far to travel to perform the inspection. Also, inspection/verification methods should be consistent. Accordingly, standardized methods of automatically generating consistent and reliable ratings of used automobiles in an online marketplace are desired.
BRIEF SUMMARY OF THE INVENTIONIn one example aspect, computerized method includes the step of providing an item listing. The item listing is listed in an e-commerce marketplace. The computerized method includes the step of identifying the item listing into a set of listing parameters. A listing parameter includes one or more listing images and one or more terms that are descriptive of the item listing. The computerized method includes using the listing parameters to do the following steps. The computerized method determines a trust score for the item listing. The trust score is based on a number of images of an item in the item listing. The computerized method a quality score of the number of images; a description score of a description of the item. The computerized method determines a pricing score. The pricing score is based on a percentage variation from a geographically relevant item valuation and an item research service, and a condition of the item. The computerized method determines a seller score. The seller score is based on the verified status of the seller, a seller rating of the seller, and a percentage of positive feedback. The computerized method determines a health score. The health score is based on a seller declaration, a service logs, a verification that the item is insured, and a verification that the item properly registered.
The Figures described above are a representative set, and are not an exhaustive set with respect to embodying the invention.
DESCRIPTIONDisclosed are a system, method, and article of providing health scores in an online marketplace. It is noted that other types of marketplaces can be used in other example embodiments. The following description is presented to enable a person of ordinary skill in the art to make and use the various embodiments. Descriptions of specific devices, techniques, and applications are provided only as examples. Various modifications to the examples described herein can be readily apparent to those of ordinary skill in the art, and the general principles defined herein may be applied to other examples and applications without departing from the spirit and scope of the various embodiments.
Reference throughout this specification to ‘one embodiment, ‘an embodiment,’ one example,’ or similar language means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases ‘in one embodiment, ‘in an embodiment,’ and similar language throughout this specification may, but do not necessarily, all refer to the same embodiment.
Furthermore, the described features, structures, or characteristics of the invention may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided, such as examples of programming, software modules, user selections, network transactions, database queries, database structures, hardware modules, hardware circuits, hardware chips, etc., to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art can recognize, however, that the invention may be practiced without one or more of the specific details, or with other methods, components, materials, and so forth. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.
The schematic flow chart, diagrams included herein are generally set forth as logical flow chart diagrams. As such, the depicted order and labeled steps are indicative of one embodiment of the presented method. Other steps and methods may be conceived that are equivalent in function, logic, or effect to one or more steps, or portions thereof, of the illustrated method. Additionally, the format and symbols employed are provided to explain the logical steps of the method and are understood not to limit the scope of the method. Although various arrow types and line types may be employed in the flow chart diagrams, and they are understood not to limit the scope of the corresponding method. Indeed, some arrows or other connectors may be used to indicate only the logical flow of the method. For instance, an arrow may indicate a waiting or monitoring period of unspecified duration between enumerated steps of the depicted method. Additionally, the order in which a particular method occurs may or may not strictly adhere to the order of the corresponding steps shown.
Definitions
Example definitions for some embodiments are now provided.
Application programming interface (API) can specify how software components of various systems interact with each other.
Cloud computing can involve deploying groups of remote servers and/or software networks that allow centralized data storage and online access to computer services or resources. These groups of remote serves and/or software networks can be a collection of remote computing services.
E-commerce can be trading in products or services using computer networks, such as the Internet. Electronic commerce draws on technologies such as mobile core commerce, electronic funds transfer, supply chain management, Internet marketing, online transaction processing, electronic data interchange (EDI), inventory management systems, and automated data collection systems.
Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data. Example machine learning techniques that can be used herein include, inter alio: decision tree learning, association rule learning, artificial neural networks, inductive logic programming, support vector machines, clustering, Bayesian networks, reinforcement learning, representation learning, similarity and metric learning, and/or sparse dictionary learning. Random forests (RF) (e.g. random decision forests) are an ensemble learning method for classification, regression and other tasks, that operate by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes (e.g. classification) or mean prediction (e.g. regression) of the individual trees. RFs can correct for decision trees' habit of overfitting to their training set. Deep learning is a family of machine learning methods based on learning data representations. Learning can be supervised, semi-supervised or unsupervised.
Weighted average is the average of values which are scaled by importance. The weighted average of values is the sum of weights times values divided by the sum of the weights.
Online marketplace can be a type of e-commerce site where product or service information is provided by multiple third parties, whereas transactions are processed by the marketplace operator. Online marketplace can enable multichannel ecommerce.
Exemplary Methods
An online marketplace is provided. The online marketplace includes an information network accessed by computing or mobile or virtual reality devices. The online marketplace can enable two parties (e.g. a buyer and a seller) to enter into an online transaction. In one example, the online marketplace can enable transactions of used items and/or service(s). It is noted that other example embodiments, the online marketplace can also enable transaction that include first-hand/new items.
The purchase of a used and/or new items may be a high value financial transactions for a user. Accordingly, the trust (e.g. health) factor may be important for the transaction to be successful. For example, when two parties whether known to each other or unknown are trying to transact, the transaction suffers from information asymmetry and moral hazard. Because of this, trade velocity can slow down, as it takes longer to establish trust and transparency for transaction to take place. Accordingly, the online marketplace can include one or more trust factors that enable buyers to make more informed decisions.
One example of a process that increases a user's trust factor can be a health score. The health score can be based on account trust factors, such as auto inspection information, warranty information, verified seller information, attractiveness of pricing for buyer information, and/or level of disclosures by the sellers. The health score can enable buyers to develop more objective assessment and comfort around the items they are considering buying. The information can also enable sellers who want to adopt best practices and offer nothing but the best to the buyer community. Seller listings with higher health scores can be automatically ranked higher in the online marketplace.
In step 102, a health score can be associated with a listing of an item on website managed by the online market that implements process 100. In step 104, these parameter values can be divided in a specified number of data buckets (e.g. four (4) buckets). Example data buckets can include, inter alia: a transparency score, a seller score, health score, a pricing score, etc. In step 106, these data buckets (e.g. the four buckets) can have different configurable weights. The weights can be determined by several factors. Additionally, geography, marketplace, and/or seasonality can also be taken into account. For example, during a festive season in India, pricing score can have a higher weight. The parameter values can also have configurable weightages. In 108, the health score calculated on the aggregated scores of the various data buckets mentioned above. The aggregated score can be the full-circle health score. The full-circle health score can capture the three-hundred and sixty (360) degree information revolving around trust of the listing in any marketplace. It is noted that process 100 (as well as other processes and systems provided herein) can be expanded to any category possible in the marketplace.
Additional Exemplary Computer Architecture and Systems
In step 902, process 900 implements data collection and storage (e.g. Online vehicle Marketplace Listing Data, Sellers data. In one example, step 902, a Mongo™ Data base architecture is used for storage.
In step 904, process 900 can implement Data Cleaning and Clustering of parameters. This can include, inter alia: Handling Duplicate Values; Missing Values Treatment (e.g. the rows were either deleted, replaced with modes or means based on the suitability for each of the column, etc.); finding correlation between the parameters to identify influencing parameters for scoring; etc. Example databases utilized can include, inter alia: data set for calculating transparency score 906; data set for calculating seller score 908; data set for calculating health score 910; data set for calculating pricing score 912; etc. In step 914, a Weight calculation module can be implemented.
Weight calculation module can calculate the following parameters. The pricing score can include, inter alia: % variation from OBV; vehicle condition; % variation from medium; s3% of listing below your price; etc. The health score can include, inter alia: kilometers driven per year; warranty and certification; test drive; returns; seller declaration; etc. The seller score can include, inter alia: verified seller (email and/or phone; past transaction history availability; etc. The transparency score can include, inter alia: number of images uploaded; quality of images; description; video uploaded; history report availability; basic factors inputs availability; key factors inputs availability's copy of service logs; copy of insurance; etc.
In step 916, process 900 can use the transactional data to update weights. This can utilize a list of transaction(s) of the listing through marketplace 922. Transaction(s) of the listing through marketplace 922 can be developed as follows. In step 924, a new vehicle listing request can be placed on the marketplace. In step 920, process 900 can implement a weight matrix and FCTS score calculation module 920. In step 918, process 900 can implement a score assignment to the product on marketplace listing.
In step 1008, process 1000 can generate the weights using multivariate linear regression technique. For example:
Y=[y]
[X]=[S1 . . . Sn, R1 . . . Rn, Q1 . . . Qn, P1 . . . Pn]*[W1 . . . Wn]+[ε . . . εn]
where Y is the output required,
[X]=nth order matrix of total number of Independent variables.
[ε]=matrix of residual function.
ε=Σ (observed value—predicted value).
To Calculate the saturation point for residual function gradient descent method is applied using partial derivative, which is defined as:
∂/∂ε=1/n*Σi(y−(X*W+εi))2
where the value of “i” is varying from 0, n.
To achieve the value for each residual point above equation is iterated from 0 to n.
In step 1010, process 1000 calculates Weight(Z) of individual bucket. For example, parameters belongs to a bucket (e.g. Health Score) are scaled to create the overall bucket score of ten (10).
In step 1012, a Full Circle Trust Score (FCTS) score calculation is calculated. This can be total score is calculated using the equation/weights calculated in step 1008.
In step 1014, process 1000 provides a feedback system to update the weights. This can be implemented, whenever a transaction happens in the online e-commerce marketplace. Transaction data can be pushed in regular interval to re-calculate the weights.
Additional Discussion of FCTS
It is noted that the FCTS is based on the technologies, systems and algorithms that consider trust factors that are important while buying used automobiles, such as, auto inspection, warranty, verified seller, attractiveness of pricing for buyer, and level of disclosures by the sellers. The FCTS can enable buyers to develop more objective assessment and comfort around the vehicles they are considering buying. The FCTS is equally attractive for sellers who want to adopt best practices and offer nothing but the best to the buyer community. Seller listings with higher trust score can automatically emerge as winners on the online vehicle marketplace.
The FCTS can have the following components. A transparency score can be provided. This score is based on listing details disclosed by Seller, such as Number of Images, Quality of Images, Description, etc.
A pricing score can be provided. This score is based on the listing price relative to market price of similar vehicles and an algorithmic pricing engine (e.g. Orange Book Value, etc.).
A seller score can be provided. Every seller is unique, hence this score is based on seller related factors such as the Ratings given to them by buyers, whether the seller is verified or not, etc.
A health score can be provided. Every seller may be trustworthy, but this score is based on factors which enhance trust on the vehicle such as inspection report, warranty, Kms driven etc.
The FCTS can have the following elements. Verified sellers can be utilized in a quantified manner. Every information is fully verified, and the sellers are issued Verified Seller badge that are visible to buyer. Also, the system cross verifies more detailed information for professional sellers/dealers.
An inspection report is provided. The system offers independent, objective and unbiased automobile inspection reports. These reports are independent and follow inspection methodology, thus do not favor a seller. A reputed partner of the system performs the inspection and prepares the report.
A seller declaration is provided. For every item that is being sold by an individual or professional seller/dealer, the system asks every seller to fill a seller declaration form on self-disclosure basis. This form is uniquely designed so that seller can disclose any unknown problems in the automobile they are selling. This process adds great level of transparency in the buying process for the buyer. In case the seller is not truthful or fully transparent, the inspection report can reveal whether the seller declaration is passing or failing.
A set of listings are provided. The listing provides for higher trust, safety and convenience. Ratings and/or reviews are provided. The buyer can read product and seller ratings and reviews to make more informed decision. The buyer can also rate the seller from whom he bought an item or can rate the product bought. If a buyer does not have a good experience or did not like the product, he can rate the seller or the product accordingly. Because all users get to see the ratings and reviews for all the seller and the products, it makes buyers more informed.
CONCLUSIONAlthough the present embodiments have been described with reference to specific example embodiments, various modifications and changes can be made to these embodiments without departing from the broader spirit and scope of the various embodiments. For example, the various devices, modules, etc. described herein can be enabled and operated using hardware circuitry, firmware, software or any combination of hardware, firmware, and software (e.g., embodied in a machine-readable medium).
In addition, it can be appreciated that the various operations, processes, and methods disclosed herein can be embodied in a machine-readable medium and/or a machine accessible medium compatible with a data processing system (e.g., a computer system), and can be performed in any order (e.g., including using means for achieving the various operations). Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense. In some embodiments, the machine-readable medium can be a non-transitory form of machine-readable medium.
Claims
1. A computerized method of providing health scores in an online marketplace comprising:
- providing, with at least one computer processor, an item listing, wherein the item listing is listed in an e-commerce marketplace;
- identifying the item listing into a set of listing parameters, wherein each listing parameter comprises one or more listing images and one or more terms that are descriptive of the item listing, wherein the listing parameter is based on an image of an automobile uploaded by a user;
- based on historical sales data, calculating a set of weights assignments of the set of listing parameters, with a Multivariate Regression model by:
- splitting the historical sales data into a training set and a test set,
- generating the weights assignments using a multivariate linear regression process, and
- applying the weights assignments to the listing parameters; and
- using the listing parameters to:
- determine a trust score for the item listing, wherein the trust score is based on a number of images of an item in the item listing, a quality score of the number of images, and a description score of a description of the item, wherein the trust score is determined using a specified machine learning algorithm comprising at least one artificial neural network, and, wherein the trust score is further based on a set of basic facts about the item, a set of key factors of the item, and a list of available options for purchasing the item;
- determine a pricing score, wherein the pricing score is based on a percentage variation from a geographically relevant item valuation and an item research service, and a condition of the item, and wherein the pricing score is based on a listing price relative to a market price of a similar automobile and an algorithmic pricing engine, and wherein the pricing score is further based on a percentage variation from median price of a set of items that are in a same class as the item and a percentage of other current item listings of the set of items that are below a quoted price in the item listing;
- determine a seller score, wherein the seller score is based on the verified status of the seller, a seller rating of the seller, and a percentage of positive feedback, and wherein every seller is unique, hence this score is based on seller related factors comprising the Ratings given to the seller by buyers, whether the seller is verified or not, and, wherein the seller score is further based on a seller engagement score, a showroom score, and a dealership score; and
- determine a health score, wherein the health score is based on a seller declaration, a service log, a verification that the item is insured, and a verification that the item is properly registered, and wherein the health score is based on factors which enhance trust on the automobile including an inspection report and a warranty,
- wherein the item comprises a used automobile, and
- wherein the pricing score is further based on a percentage variation from median price of a set of items that are in a same class as the item and a percentage of other current item listings of the set of items that are below a quoted price in the item listing.
2. The computerized method of claim 1 further comprising:
- generating a full-health score based on a weighted average of the trust score, the pricing score, the seller score and the health score.
3. A server system comprising:
- a processor configured to execute instructions;
- a memory containing instructions which, when executed on the processor, cause the processor to perform operations that: provide, with at least one computer processor, an item listing, wherein the item listing is listed in an e-commerce marketplace; identify the item listing into a set of listing parameters, wherein each listing parameter comprises one or more listing images and one or more terms that are descriptive of the item listing, wherein the listing parameter is based on an image of the automobile uploaded by a user; based on historical sales data, calculate a set of weights assignments of the set of listing parameters values with a Multivariate Regression model by: splitting the historical sales data into a training set and a test set, generating the weights assignments using a multivariate linear regression process, and applying the weights assignments to the listing parameters; and using the listing parameters to: determine a trust score for the item listing, wherein the trust score is based on a number of images of an item in the item listing, a quality score of the number of images, and a description score of a description of the item, wherein the trust score is determined using a specified machine learning algorithm comprising at least one artificial neural network, and wherein the trust score is further based on a set of basic facts about the item, a set of key factors of the item, and a list of available options for purchasing the item; determine a pricing score, wherein the pricing score is based on a percentage variation from a geographically relevant item valuation and an item research service, and a condition of the item, and wherein the pricing score is based on a listing price relative to a market price of a similar automobile and an algorithmic pricing engine, and wherein the pricing score is further based on a percentage variation from median price of a set of items that are in a same class as the item and a percentage of other current item listings of the set of items that are below a quoted price in the item listing; determine a seller score, wherein the seller score is based on the verified status of the seller, a seller rating of the seller, and a percentage of positive feedback, and wherein every seller is unique, hence this score is based on seller related factors comprising the Ratings given to the seller by buyers, whether the seller is verified or not, and wherein the seller score is further based on a seller engagement score, a showroom score, and a dealership score; and determine a health score, wherein the health score is based on a seller declaration, a service log, a verification that the item is insured, and a verification that the item is properly registered, and wherein the health score is based on factors which enhance trust on the automobile including an inspection report and a warranty, wherein the item comprises a used automobile, and wherein the pricing score is further based on a percentage variation from median price of a set of items that are in a same class as the item and a percentage of other current item listings of the set of items that are below a quoted price in the item listing.
4. The server system of claim 3, wherein the trust score is further based on a set of basic facts about the item, a set of key factors of the item, and a list of available options for purchasing the item.
5. The server system of claim 4, wherein the seller score is further based on a seller engagement score, a showroom score, and a dealership score.
8005759 | August 23, 2011 | Hirtenstein |
20070162293 | July 12, 2007 | Malkon |
20080046383 | February 21, 2008 | Hirtenstein |
20170337573 | November 23, 2017 | Toprak |
Type: Grant
Filed: Sep 29, 2019
Date of Patent: Aug 30, 2022
Patent Publication Number: 20200160417
Inventor: Sandeep Aggarwal (gurgaon)
Primary Examiner: Nicholas D Rosen
Application Number: 16/587,046
International Classification: G06Q 30/06 (20120101); G06Q 30/02 (20120101); G06K 9/62 (20220101); G06N 20/00 (20190101);