Patents Assigned to Oxylabs, UAB
-
Patent number: 12294628Abstract: Proxy servers within a service provider infrastructure are enabled to maintain multiple persistent connections among themselves and to exchange data bi-directionally in an unsolicited manner. Specifically, exit proxy servers are enabled to request their respective proxy supernodes to update the already existing network connection to support WebSocket communication channels. Accordingly, the respective proxy supernodes are enabled to update the network connection with the exit proxy servers to support WebSocket communication channels. A single instance of a proxy supernode and an exit proxy server can maintain multiple WebSocket communication channels with each other. By utilizing the said WebSocket communication channels, the proxy supernode and the exit proxy servers can exchange data with each other simultaneously without any data losses.Type: GrantFiled: January 26, 2024Date of Patent: May 6, 2025Assignee: Oxylabs, UABInventor: Gediminas Dagys
-
Patent number: 12287837Abstract: Disclosed herein are system, method, and computer program product embodiments for improving web scraping technology by using machine learning to generate parsing expressions. A system receives a request to identify an element in a first document at a target web page. The system downloads and modifies the first document by adding an index value as an attribute to a tag for the element. A query is submitted to a large language model (LLM), including the modified first document, a description of the element, and a request asking the LLM to identify the element based on the description. The system obtains, from the LLM, the index value assigned to the element. The system generates an expression defining a path to the element in the first document using the index returned by the large language model. The system downloads a second document, and parses data of a second element using the expression.Type: GrantFiled: September 10, 2024Date of Patent: April 29, 2025Assignee: Oxylabs, UABInventors: Karolis Kluonaitis, Martynas Juravicius, Andrius Kuksta
-
Publication number: 20250124090Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: ApplicationFiled: October 31, 2024Publication date: April 17, 2025Applicant: Oxylabs, UABInventors: Eivydas VILCINSKAS, Arnas PETRUŠKEVICIUS, Giedrius STALIORAITIS, Martynas JURAVICIUS, Rimantas STANKEVICIUS
-
Publication number: 20240406466Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: ApplicationFiled: August 16, 2024Publication date: December 5, 2024Applicant: Oxylabs, UABInventor: Giedrius STALIORAITIS
-
Patent number: 12149755Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: GrantFiled: April 14, 2022Date of Patent: November 19, 2024Assignee: Oxylabs, UABInventor: Giedrius Stalioraitis
-
Publication number: 20240356920Abstract: Systems and methods to manage and efficiently perform authorization of multiple proxy clients are disclosed. Furthermore, systems and methods to measure and check whether the web traffic of one or more client devices has reached a permissible limit of web traffic assigned by the proxy service provider. Specifically, a proxy is configured to gather and save authorization information of one or more clients within its memory. Therefore, the proxy server can verify and authorize one or more clients by utilizing the data from its memory. Furthermore, the proxy is configured to measure and report the utilized web traffic of one or more client devices to a messaging platform. In another aspect, systems and methods to check whether one or more client devices have reached a permissible amount of web traffic limit are disclosed.Type: ApplicationFiled: June 28, 2024Publication date: October 24, 2024Applicant: Oxylabs UABInventors: Miroslav KOZLOVSKI, Gytis CIUTA, Valdemar KARASEVIC
-
Publication number: 20240176829Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: ApplicationFiled: December 11, 2023Publication date: May 30, 2024Applicant: Oxylabs, UABInventors: Eivydas VILCINSKAS, Arnas PETRUŠKEVICIUS, Giedrius STALIORAITIS, Martynas JURAVICIUS, Rimantas STANKEVICIUS
-
Publication number: 20240179092Abstract: Traffic services for network addresses may be provided within threads executing within a main process for managing the traffic services. The threads may share resources within the main process, reducing the computing resources consumed to provide traffic services to large pools of network addresses. According to one embodiment, a method may include executing a main process for managing traffic services; determining, by the main process, a configuration specifying at least one or more destination addresses; instantiating, by the main process, one or more traffic service (TS) threads for the one or more destination addresses; and/or processing, by the one or more traffic service (TS) threads, inbound traffic for the corresponding one or more destination addresses. Other aspects and embodiments for traffic management are also disclosed.Type: ApplicationFiled: February 2, 2024Publication date: May 30, 2024Applicant: Oxylabs, UABInventors: Gediminas Dagys, Paulius Vitkus
-
Publication number: 20240171640Abstract: Proxy servers within a service provider infrastructure are enabled to maintain multiple persistent connections among themselves and to exchange data bi-directionally in an unsolicited manner. Specifically, exit proxy servers are enabled to request their respective proxy supernodes to update the already existing network connection to support WebSocket communication channels. Accordingly, the respective proxy supernodes are enabled to update the network connection with the exit proxy servers to support WebSocket communication channels. A single instance of a proxy supernode and an exit proxy server can maintain multiple WebSocket communication channels with each other. By utilizing the said WebSocket communication channels, the proxy supernode and the exit proxy servers can exchange data with each other simultaneously without any data losses.Type: ApplicationFiled: January 26, 2024Publication date: May 23, 2024Applicant: Oxylabs, UABInventor: Gediminas DAGYS
-
Publication number: 20240104106Abstract: A parsing facility within a service provider infrastructure can navigate through source documents of target web pages and mine a specific list of target data by utilizing multiple parsing frames received from an external computing resource and/or system. The parsing facility receives a series of a plurality of parsing frames at random intermittent intervals. The parsing facility can store each of the plurality of parsing frames within its internal storage and learns the differences between each of the plurality of parsing frames. After learning the differences, the parsing facility can recognize appropriate parsing frames to locate and mine each target data from the source documents. The parsing facility can mine data from source documents by using each of the plurality of parsing frames for every mining cycle, thereby effectively managing the reception and usage of multiple parsing frames without any errors or faults.Type: ApplicationFiled: September 27, 2022Publication date: March 28, 2024Applicant: Oxylabs, UABInventor: Tadas MALINAUSKAS
-
Publication number: 20240104145Abstract: Embodiments relate to scraping web content. When scraping data, the target website sometimes redirects to different URLs within its domain. The different URLs represent the same context. Embodiments use a graph ontology to identify which redirected URLs represent the same page.Type: ApplicationFiled: September 22, 2022Publication date: March 28, 2024Applicant: Oxylabs UABInventor: Tadas BARZDZIUS
-
Patent number: 11929926Abstract: Traffic services for network addresses may be provided within threads executing within a main process for managing the traffic services. The threads may share resources within the main process, reducing the computing resources consumed to provide traffic services to large pools of network addresses. According to one embodiment, a method may include executing a main process for managing traffic services; determining, by the main process, a configuration specifying at least one or more destination addresses; instantiating, by the main process, one or more traffic service (TS) threads for the one or more destination addresses; and/or processing, by the one or more traffic service (TS) threads, inbound traffic for the corresponding one or more destination addresses. Other aspects and embodiments for traffic management are also disclosed.Type: GrantFiled: September 13, 2022Date of Patent: March 12, 2024Assignee: Oxylabs, UABInventors: Gediminas Dagys, Paulius Vitkus
-
Publication number: 20240080243Abstract: Systems and methods to manage and efficiently implement functional proxy services are disclosed. In the proxy services, a single instance of exit-node is connected to at least two or multiple supernodes at any given time. One of the plurality of supernodes is configured to ping and send diagnostic requests to the connected exit-node through a network. The exit-node is directed to send the pong message and diagnostic response data to a different supernode from among the plurality of supernodes connected to the exit-node. Likewise, a client's request is received by an element of the proxy service provider and forwarded to a specific supernode capable of forwarding the client's request to the exit-node. After performing the client's request, the exit-node returns response data to a different supernode from among the plurality of supernodes connected to the exit-node.Type: ApplicationFiled: April 26, 2023Publication date: March 7, 2024Applicant: Oxylabs, UABInventors: Miroslav KOZLOVSKI, Arnas PETRUSKEVICIUS, Valdas PILKAUSKAS
-
Publication number: 20240048638Abstract: Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.Type: ApplicationFiled: October 6, 2023Publication date: February 8, 2024Applicant: Oxylabs, UABInventors: Valdas PILKAUSKAS, Miroslav KOZLOVSKI, Gytis CIUTA
-
Publication number: 20240022635Abstract: Proxy servers within a service provider infrastructure are enabled to maintain multiple persistent connections among themselves and to exchange data bi-directionally in an unsolicited manner. Specifically, exit proxy servers are enabled to request their respective proxy supernodes to update the already existing network connection to support Web Socket communication channels. Accordingly, the respective proxy supernodes are enabled to update the network connection with the exit proxy servers to support WebSocket communication channels. A single instance of a proxy supernode and an exit proxy server can maintain multiple Web Socket communication channels with each other. By utilizing the said Web Socket communication channels, the proxy supernode and the exit proxy servers can exchange data with each other simultaneously without any data losses.Type: ApplicationFiled: April 10, 2023Publication date: January 18, 2024Applicant: Oxylabs, UABInventor: Gediminas DAGYS
-
Publication number: 20230412701Abstract: Systems and methods for coordinating network connectivity and communication between proxy servers, exit-nodes and client modules are disclosed. In one aspect, the proxy infrastructure enables network connectivity between exit-nodes and proxy-nodes without the need of any proxy-gateways or middleware entities to delegate the connections. Proxy-nodes in the proxy infrastructure accept connections with exit-nodes based on geographical proximity, proxy-node metrics, such as server loads and clients' frequent preferences. Further, a single instance of proxy-node can communicate and instruct another instance of proxy-node, to service the client request via a suitable exit-node. Further still, a single instance of proxy-node can communicate and instruct another instance of proxy-node to redirect a suitable exit-node towards the first proxy-node in order to service the client request.Type: ApplicationFiled: June 21, 2022Publication date: December 21, 2023Applicant: Oxylabs, UABInventors: Valdas PILKAUSKAS, Miroslav KOZLOVSKI
-
Publication number: 20230336793Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: ApplicationFiled: April 14, 2022Publication date: October 19, 2023Applicant: Oxylabs, UABInventor: Giedrius STALIORAITIS
-
Patent number: 11770457Abstract: System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.Type: GrantFiled: September 27, 2022Date of Patent: September 26, 2023Assignee: Oxylabs, UABInventors: Valdas Pilkauskas, Karolis Dzekunskas
-
Patent number: 11741174Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: GrantFiled: March 21, 2022Date of Patent: August 29, 2023Assignee: Oxylabs, UABInventors: Eivydas Vilcinskas, Arnas Petru{hacek over (s)}kevicius, Giedrius Stalioraitis, Martynas Juravicius, Rimantas Stankevicius
-
Patent number: 11734367Abstract: Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.Type: GrantFiled: September 30, 2022Date of Patent: August 22, 2023Assignee: Oxylabs, UABInventors: Eivydas Vilcinskas, Arnas Petruskevicius, Giedrius Stalioraitis, Martynas Juravicius, Rimantas Stankevicius